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Abstract 

Given the return series for a set of instruments, a trading strategy is a switching function 
' that transfers wealth from one instrument to another at specified times. We present efficient 

q ' algorithms for constructing (ex-post) trading strategics that are optimal with respect to the 

total return, the Sterling ratio and the Sharpc ratio. Such ex-post optimal strategies are useful 
analysis tools. They can be used to analyze the "profitability of a market" in terms of optimal 
trading; to develop benchmarks against which real trading can be compared; and, within an 



> 

inductive framework, the optimal trades can be used to to teach learning systems (predictors) 
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which are then used to identify future trading opportunities. 



On 

§ ; 1 Introduction 

A trader has in mind the task of developing a trading system that optimizes some profit criterion, 
• i— i ■ 

^ ■ the simplest being the total return. A more conservative approach is to optimize a risk adjusted 

return. Widely followed measures of risk adjusted returns are the Sterling Ratio and Sharpe Ratio. 
In an enviroment where markets exhibit frequent crashes and portfolios encounter sustained periods 
of losses, the Sterling ratio and the Sharpe ratio have emerged as the leading performance measures 
used in the industry. Given a set of instruments, a trading strategy is a switching function that 
transfers the wealth from one instrument to another. In this paper, we consider the problem of 
finding optimal trading strategies, i.e., trading strategies that maximize a given optimality criterion, 
on historical data. In particular, we consider optimal strategies with respect to the total cumulative 
return, as well as with respect to various risk adjusted measures of return (the Sterling ratio and 
variants of the Sharpe ratio). Finding the optimal trading strategy for non-zero transactions cost 
is a path dependent optimization problem even when the price time series is known. A brute force 
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approach to solving this problem would search through the space of all possible trading strategies, 
keeping only the one satisfying the optimality criterion. Since the number of possible trading 
strategies grows exponentially with time, the brute force approach leads to an exponential time 
algorithm^, which for all practical purposes is infeasible - even given the pace at which computing 
power grows. The contribution in this work is to give efficient (polynomial time) algorithms to 
compute the optimal trading strategy for various profit objectives, with or without constraints on 
the number of trades that can ber made. Our motivations for constructing such optimal strategies 
are: 

(i) Knowing what the optimal trades are, one can take an inductive approach to real trading: on 
historical data, one can construct the optimal trades; one can then correlate various market 
and/or technical indicators with the optimal trades. These indicators can then be used to 
identify future trading opportunities. In a sense, one can try to learn to predict good trading 
opportunities based on indicators by emulating the optimal trading strategy. A host of such 
activity within the inductive framework, goes under the name of financial engineering. 

(ii) The optimal trading performance under certain trading constraints can be used as a bench- 
mark for real trading systems. For example, how good is a trading system that makes ten 
trades with a Sterling ratio of 4 over a given time period? One natural comparison is to 
benchmark this trading strategy against a Sterling-optimal trading strategy that makes at 
most ten trades over the same time period. 

(iii) Optimal trading strategies (with or without constraints) can be used to quantitatively rank 
various markets (and time scales) with respect to their profitability according to a given 
criterion. So for example, one could determine the optimal time scale on which to trade a 
particular market, or given a set of markets, which is the most (risk adjusted) profit-friendly. 

(iv) Given a stochastic model for the behavior of a pair of instruments, one can use the efficient 
algorithms presented here to construct ex-ante optimal strategies using simulation. To be 
more specific, note that the optimal strategy constructed by our algorithms requires full 
knowledge of the future price paths. The stochastic model can be used to generate sample 
paths for the instruments. These sample paths can be used to compute the optimal trading 
strategy given the current history and information set. One then has a sample set of future 
paths and corresponding optimal trading strategies on which to base the current action. Note 
that such a stochastic model for future prices would have to take into account correlations 
(including auto-correlations) among the instruments. 

The asymptotic running time of an algorithm is measured in terms of the input size n. If the input is a time 
sequence of n price data points, then polynomial time algorithms have run time that is bounded by some polynomial 
in n. Exponential time algorithms have running time greater than some exponentially growing function in n [6]. 
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It is beyond the scope of the current discussion to develop these applications. Our main goal here 
is to present the algorithms for obtaining optimal trading strategies, given a price time series. 



1.1 Trading Model 

We now make the preceeding discussion more precise. We consider optimal trading strategies on 
two instruments, for concreteness, a stock S and a bond B with price histories {So, . . . ,S n } and 
{Bo, . . . , B n } over n consecutive time periods, t{, i 6 {1, . . . , n). Thus, for example, over time period 
ti, the price of stock moved from Si-i to Si. We denote the return sequence for the two instruments 
by {s\, . . . , s n } and {b\, . . . , b n } respectively: Si = log ;<j- L p an d correspondingly, bi = log We 
assume that one of the instruments is the benchmark instrument, and that all the equity is held 
in the benchmark instrument at the begining and end of trading. The bond is usually considered 
the benchmark instrument, and for illustration, we will follow this convention. The trivial trading 
strategy is to simply hold onto bond for the entire duration of the trading period. It is useful to 
define the excess return sequence for the stock, Sj = Sj — 6j. When the benchmark instrument is the 
bond, the excess return as we defined it is the conventionally used one. However, one may want to 
measure performances of a trading strategy with respect to the S&P 500 as benchmark instrument, 
in which case the excess return would be determined relative to the S&P 500 return sequence. 
The excess return sequence for the bond is just the sequence of zeros, bi = 0. Conventionally, 
the performance of a strategy is measured relative to some trivial strategy, so the excess return 
sequence will be the basis of most of our performance measures. 

Definition 1.1 (Trading Strategy) A trading strategy T is a boolean n- dimensional vector in- 
dicating where the money is at the end of time period ti : 



1 if money is in stock at the end ofti, 
if money is in bond at the end of U. 



We assume that T[0] = T[n] = 0, i.e., all the money begins and ends in bond. A trade is entered 
at time ti if T[i] = and T[i + 1] = 1. A trade is exited at time ti if T[i] = 1 and T[i + 1] =0. 
The number of trades made by a trading strategy is equal to the number of trades that are entered. 

We make the following assumptions regarding the trading: 

Al [All or Nothing]: The position at all times is either entirely bond or entirely stock. 

A2 [No Market Impact]: Trades can be placed without affecting the quoted price. 

A3 [Fractional Market]: Arbitrary amounts of stock or bond can be bougnt or sold at any time. 

A4 [Long Strategies]: We assume that we can only hold long positions in stock or bond. 
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Assumption Al is in fact not the case in many trading funds, for it does not allow legging into a 
trade, or holding positions in both instruments simultaneously. While this is technicaly a restriction, 
for many optimality criteria (for example return optimal strategies) , one can show that there always 
exists an all-or-nothing optimal strategy. Thus, we maintain this simplifying assumption for our 
discussion. Further, such assumptions are typically made in the literature on optimal trading (see 
for example [T3] )• Assumptions A2 A4 are rather mild and quite accurate in most liquid markets, 
for example foreign exchange. Assumption A3 is needed for Al, since if all the money should be 
transfered to a stock position, this may necessitate the purchase of a fractional number of shares. 
Note that if T[i — 1] 7^ T[i], then at the begining of time period ti, the position was transfered from 
one instrument to another. Such a transfer will incur an instantaneous per unit transaction cost 
equal to the bid-ask spread of the instrument being transfered into. We assume that the bid-ask 
spread is some fraction (/# for bond and fs for stock) of the bid price. 

We denote the equity curve for a trading strategy T by the vector £7-, i.e., &r[i] is the equity 
at the end of time period ti, with £t[0] — 1- Corresponding to the equity curve is the excess return 
sequence r-y for the trading strategy T, i.e., for i > 1 

r r \i] =\og-^--h. (1) 

If we ignore the bid-ask spread, then the excess return in time period ti is given by 

r r \i] =SiT[i] = {si-bi)T[i}. (2) 

The bid-ask spread affects the return, reducing it by an amount depending on T[i — 1]. Denoting 
this transactions cost attributable to T[i] by we have that 

ah = -r\i - i](i - T[i\)f B - (1 - n - mm, (3) 

where fs = log(l+/s) and fs = log(l + /s). Thus, the bid-ask spread can be viewed as introducing 
an instantaneous return of — fs or —fs whenever the position is switched. To exactly which time 
period this transactions cost is applied may depend on the nature of the market, i.e., it may be 
applied to rf[i], rj-[i — 1] or rf[i + 1]. The nature of the results will not change significantly 
for either of these options, so in our algorithms, we will generally make the choice that offers the 
greatest technical simplicity. For a trading strategy T, we define the total return fJ>(T), the sum of 
the squared returns s 2 (T), the sum of squared deviations of the returns <J 2 {T) and the maximum 
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drawdown MDD(T) as follows, 



KT) = £rrH, (4) 

i=l 
n 

s\T) = J> r [f, (5) 

i=l 

^ 2 cn = E ^]--MTl = s 2 (r)-V(r), (6) 

MDD{T) = max -J> r [i]. (7) 

When it is clear from the context what trading strategy we are talking about, we will generally 
suppress the explicit dependence on T ■ The performance measures that we consider in this paper 
are derived from these statistics. In particular, we are interested in the total return fi, the Sterling 
ratio Strl, and variants of the Sharpe ratio, Shrpj^ and Shrp 2 : 

strl < r > = Jimkr ™ = f^ ™ = ^ < 8 > 

Shrp! is the conventionally used Sharpe ratio. Shrp 2 is a more risk averse performance measure, as 
it is more sensitive to the variance in the returns. Often, Strl as we have defined it is refered to as 
the Calmar ratio in the literature [8], and the Sterling ratio adds a constant (for example 10%) to 
the MDD in the denominator pQ. Such a constant can easily be accomodated by our algorithms, 
and so we will maintain this simpler definition for the Sterling ratio. 

The contribution of this paper is efficient algorithms for computing optimal trading strategies. 
We will use standard OQ notation in stating our results: let n be the length of the returns sequences; 
we say that the run time of an algorithm is 0(f(n)) if, for some constant C, the runtime is < Cf(n) 
for any possible return sequences. If f(n) is linear (quadratic), we say that the runtime is linear 
(quadratic). We will establish the following results. 

Theorem 1.2 (Return Optimal Trading Strategies) A total return optimal trading strategy 
can be computed in linear time. Specifically, 

i. Unconstrained Trading. A trading strategy 7^ can be computed in O(n) such that for any 
other strategy T, /u(7^) > ^(T). 

ii. Constrained Trading. A trading strategy making at most K trades can be computed in 
0(K ■ n) such that for any other strategy T K making at most K trades, fJt(T^) > fi(T K ). 

Proof: See section [2J ■ 
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Theorem 1.3 (Sterling Optimal Trading Strategies) A Sterling optimal trading strategy can 
be computed in near linear time. Specifically, 

i. Unconstrained Trading. A trading strategy Tstri can be computed in O(nlogn) such that 
for any other strategy T, Strl(7strl) > Strl(T). 

ii. Constrained Trading. A trading strategy T^ r \ making at most K trades can be computed in 
O(nlogn) such that for any other strategy T K making at most K trades, Strl(7s^i) > Strl(T^). 

Proof: See section [HJ ■ 

Theorem 1.4 (Sharpe Optimal Trading Strategies) A Sharpe optimal trading strategy can 
be computed in near quadratic time. Specifically, trading strategies Tshrp! an d 7shrp 2 can be found 
in 0(n 2 logn) such that for any other strategy T , Shrp 1 (7shrp 1 ) > Shrp 1 (7") and Shrp 2 (7shrp 2 ) > 
Shrp 2 (T) 

Proof: See section [H ■ 

In all cases, our proofs are constructive, and so immediately give the algorithms for performing the 
desired computations. Next, we discuss the existing related work, followed by a detailed discussion 
of the algorithms, along with all necessary proofs. 



Related Work 

The body of literature on optimal trading is so enormous that we only highlight here some repre- 
sentative papers. The reasearch on optimal trading falls into two broad categories. The first group 
is on the more theoretical side where researchers assume that instrument prices satisfy some partic- 
ular model, for example the prices are driven by a stochastic process of known form; the goal is to 
derive closed-form solutions for the optimal trading strategy, or a set of equations that the optimal 
strategy must follow. The main drawbacks of such theoretical approaches is that their prescriptions 
can only be useful to the extent that the assumed models are correct. Our work does not make any 
assumptions about the price dynamics to construct ex-post optimal trading strategies. 

The second group of research which is more on the practical side is focused on exploring data 
driven / learning methods for the prediction of future stock prices moves and trading opportunities. 
Intelligent agents are designed by training on past data and their performance is compared with 
some benchmark strategies. Our results furnish (i) optimal strategies on which to train intelligent 
agents and (ii) benchmarks with which to compare their performance. 
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Theoretical Approaches Boyd et al. in consider the problem of single-period portfolio 
optimization. They consider the maximization of the expected return subject to different types of 
constraints on the portfolio (margin, diversification, budget constraints and limits on variance or 
shortfall risk). Under certain assumptions on the returns distribution, they reduce the problem to 
numerical convex optimization. Similarily, Thompson in p3] considered the problem of maximizing 
the (expected) total cumulative return of a trading strategy under the assumption that the asset 
price satisfies a stochastic differential equation of the form dSt = dBt + h(Xt)dt, where Bt is a 
Brownian motion, h is a known function and Xj is a Markov Chain independent of the Brownian 
motion. In this work, he assumes fixed transaction costs and imposes assumptions Al, A2, A4 
on the trading. He also imposes a stricter version of our assumption A3: at any time, the trader 
can have only or 1 unit of stock. He proves that the optimal trading strategy is the solution of a 
free-boundary problem, gives explicit solutions for several functions h and provides bounds on the 
transaction cost above which it is optimal never to buy the asset at all. 

Pliska et al. in [3] and Bielecki et al. in [3] considered the problems of optimal investment 
with stochastic interest rates in simple economies of bonds and a single stock. They characterize 
the optimal trading strategy in terms of a nonlinear quasi-variational inequality and develop a 
numerical approaches to solving these equations. 

Some work has been done within risk-return frameworks. Berkelaar and Kouwenberg in [2] 
considered asset allocation in a return versus downside-risk framework, with closed-form solutions 
for asset prices following geometric Brownian motions and constant interest rates. Liu in [lUj 
consider the optimal investment policy of a constant absolute risk aversion (CARA) investor who 
faces fixed and proportional transaction costs when trading multiple uncorrelated risky assets. 

Zakamouline in [16] studies the optimal portfolio selection problem using Markov chain approx- 
imation for a constant relative risk averse investor who faces fixed and proportional transaction 
costs and maximizes expected utility of the investor's end-of-period wealth. He identifies three 
disjoint regions (Buy, Sell and No- Transaction) to describe the optimal strategy. 

Choi and Liu in [13] considered trading tasks faced by an autonomous trading agent. An au- 
tonomous trading agent works as follows. First, it observes the state of the environment. According 
to the environment state, the agent responds with an action, which in turn influences the current 
environment state. In the next time step, the agent receives a feedback (reward or penalty) from 
the environment and then perceives the next environment state. The optimal trading strategy for 
the agent was constructed in terms of the agent's expected utility (expected accumulated reward). 

Cuoco et al. in [7] considered Value at Risk as a tool to measure and control the risk of the 
trading portfolio. The problem of a dynamically consistent optimal porfolio choice subject to the 
Value at Risk limits was formulated and they proved that the risk exposure of a trader subject to 
a Value at Risk limit is always lower than that of an unconstrained trader and that the probability 
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of extreme losses is also decreased. 

Mihatsch and Neuneier in [12] considered problem of optimization of a risk-sensitive expected 
return of a Markov Decision Problem. Based on an extended set of optimality equations, risk- 
sensitive versions of various well-known reinforcement learning algorithms were formulated and 
they showed that these algorithms converge with probability one under reasonable conditions. 

Data Driven Approaches Moody and Saffell in [9] presented methods for optimizing portfolios, 
asset allocations and trading systems based on a direct reinforcement approach, which views opti- 
mal trading as a stochastic control problem. They developed reccurent reinforcement learning to 
optimize risk-adjusted investment returns like the Sterling Ratio or Sharpe Ratio, while accounting 
for the effects of transaction costs. 

Liu et al. in [15] proposed a learning-based trading strategy for portfolio management, which 
aims at maximizing the Sharpe Ratio by actively reallocating wealth among assets. The trading 
decision is formulated as a non- linear function of the latest realized asset returns, and the function 
cam be approximated by a neural network. In order to train the neural network, one requires a 
Sharpe-Optimal trading strategy to provide the supervised learning method with target values. In 
this work they used heuristic methods to obtain a locally Sharp-optimal trading strategy. The 
transaction cost was not taken into consideration. Our methods can be considerably useful in the 
determination of target trading strategies for such approaches. 

2 Return-Optimal Trading Strategies 

We use the notation [U,tj] to denote the set of time periods {U,ti + i, . . . ,tj}. In order to compute 
the return optimal strategies, we will use a dynamic programming approach to solve a more general 
problem. Specifically, we will construct the return optimal strategies for every prefix of the returns 
sequence. First we consider the case when there is no restriction on the number of trades, and 
then the case when the number of trades is constrained to be at most K. Although we maintain 
assumptions A1-A4 for simplicity, Al, A3 and A4 can be relaxed without much additional effort. 

2.1 Overview of the Algorithm 

The basic idea of the algorithm is to consider the optimal strategy to time period £j. This strategy 
must end in either stock or bond. Suppose that it ends in stock, then it must arrive at the final 
position in stock at ti by either passing through stock or bond at time t%—x- Thus, the optimal 
strategy which ends in stock at time ti must be either the optimal strategy which passes through 
stock at time followed by holding the stock for one more time period, or the optimal strategy 
which passes through bond at time and then makes a trade into the stock for the next time 
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period. Whichever is better among these two options yields the optimal strategy to time period 
U that ends in stock. A similar argument applies to the optimal strategy to time that ends in 
bond. Thus, having computed the optimal strategies which end in stock and bond to time £j_ 1, we 
can compute the optimal strategies which end in stock and bond to time tj. This induction can be 
propagated to obtain the final result. 

When there are constraints on the number of trades, we only need to slightly modify the above 
argument. We would like to compute the optimal strategy which ends (say) in stock and makes at 
most K trades. Any such strategy has to be one of two possibilities: it makes at most K trades 
ending in stock at time £»— 1, or it makes at most K — 1 trades, ending in bond at time U-\. If it 
ended in bond, it can only make at most K — 1 trades because one additional trade will be required 
to convert from bond at £j_i to stock at ij. Thus the inductive construction will start with K = 
which is to hold bond. Assuming we have computed all the optimal strategies for K = k to all 
times {ti}, we can then compute all the optimal strategies for K = k + 1 to all times. 

We now present the details and proofs of correctness for these two algorithms. 

2.2 Unconstrained Return-Optimal Trading Strategies 

First we give the main definitions that we will need in the dynamic programming algorithm to 
compute the optimal strategy. Consider a return-optimal strategy for the first m time periods, 
[ii,i m ]. Define S[m, 0] (<S[m, 1]) to be a return-optimal strategy over the first m periods ending 
in bond (stock) at time t m . For £ £ {0,1}, let fj,[m,£] denote the return of S[m,£] over [ii,t m ], 
i.e., fi[m,£] = fj,(S[m,£]). Let PREV[m,£] denote the penultimate position of the optimal strategy 
S[m,£] just before the final time period t m . 

The optimal strategy S[m, l\ must pass through either bond or stock at time period m — 1. 
Thus, «S[m, •£] must be the extension of one of the optimal strategies {<S[m — 1,0], S[m — 1, 1]} by 
adding the position i at time period t m . More specifically, 



In particular, S[m,£] will be the extension that yields the greatest total return. Using eq:ret and 
eq:tx, we have that 




n({S[m - 1, 0},£}) = /i[m -1,0]+ s m £ - f s £, 
n({S[m- 1,1], *}) =n[m- 1,1] +s m £- f B {\-£). 
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Since ^[m,£] is the maximum of these two values, we have the following recursion, 

fi[m,£] = max |/x[m - 1,0] + s m £ - f s £, fJ,[m - 1,1] + s m £ - f B (l - £)j . 

The position of the optimal strategy S[m, £} just before time period m is given by the ending 
position of the strategy that was extended. Thus, 



PREV[to,£] 



if n[m -1,0]+ s m £ - f s £ > (i[m -1,1] + s m £ - f B (l 

1 otherwise. 



If we already know fi[m — 1,0] and [i\m — 1, 1], then we can compute fj,[m,£] and PREv[m,£] for 
£ € {0, 1} in constant time. Further, we have that yu[l)0] — and /u[l, 1] = si — fs, and so, by a 
straight forward induction, we can prove the following lemma. 

Lemma 2.1 Prev[to,£] for all £ £ {0, 1} and m < n can be computed in 0(n). 

The optimal strategy 7^ is exactly S[n, 0]. PREv[n,0] gives the position at t n -i, and the optimal 
way to reach PREv[n,0] at t n -\ is given by optimal strategy S[n — 1, PREv[n, 0]]. Continuing 
backward in this fashion, it is easy to verify that we can reconstruct the full strategy 7^ using the 
following backward recursion: 

TM = 0, 

Tfj,[m] = PREV[m + l,7^[m + 1]], for 1 < m < n. 

Thus, a single backward scan is all that is required to compute T^i] for all i G {1, . . . ,n}, which 
is linear time, and so we have proved the first part of Theorem 11.21 Further, it is clear that the 
algorithm requires memory that is linear in n to store PREv[m,^]. While we have assumed that 
the algorithm works with excess returns, the optimal strategy does not depend on this assumption, 
thus the algorithm works correctly even with the actual return sequences. The generalization of 
this algorithm to N > 2 instruments is straightforward by suitably generalizing a trading strategy. 
S[m, £] retains its definition, except now £ E {0, . . . , N — 1}. To compute /u[m, £] will need to take a 
maximum over N terms depending on \x\m — 1,£'], and so the algirithm will have runtime O(Nn). 

One concern with the unconstrained optimal strategy is that it may make too many trades. It 
is thus useful to compute the optimal strategy that makes at most a given number of trades. We 
discuss this next. 
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2.3 Constrained Return-Optimal Strategies 

We suppose that the number of trades is constrained to be at most K. It is more convenient to 
consider the number of jumps k, which we define as the sum of the number of trades entered and 
the number exited. For a valid trading strategy, the number of trades entered equals the number 
of trades exited, so k = 2K. Analogous to S[m, £] in the previous section, we define S[m, k, £] to be 
the optimal trading strategy to time period t m that makes at most k jumps ending in instrument 
£. Let n[m,k,£] be the return of strategy S[m,k,£], and let PREV[m, k,l] store the pair (k',£'), 
where £' is the penultimate position of S[m,k,£] at t m -\ that leads to the end position £, and k' 
is the number of jumps made by the optimal strategy to time period t m -\ that was extended to 
S[m, k, £]. 

The algorithm once again follows from the observation that the the optimal strategy S[m,k,£] 
must pass through either bond or stock at t m -\. A complication is that if the penultimate position 
is bond and £ = 0, then at most k jumps can be used to get to thhe penultimate position, however, 
if £ = 1, then only at most k — 1 jumps may be used. Similarily if the penultimate position is stock. 
We thus get the following recursion, 

fi[m,k,0] = max |/i[m — 1, k, 0], fi[m — 1, k — 1, 1] — fs\ , 

fj,[m, k, 1] = max |/x[m - 1, k - 1, 0] + s m - fs, fJ,[m - 1, k, 1] + s m | . 

This recursion is initialized with fjt[m, 0,0] = and fJ,[m, 0, 1] = NULL for 1 < m < n. Once 
fj>[m, k, £] is computed for all m, £, then the above recursion allows us to compute fJ,[m, k + 1, £] for 
all m,£. Thus, the computation of /j,[m,k,£] for 1 < m < n, < k < 2K and £ S {0,1} can be 
accomplished in 0{nK). Once again, the strategy that was extended gives Prev[to, k,£], 

(k, 0) if n[m - 1, k, 0] > (j,[m — l,k — 1, 1] - f B , 
(k — 1,1) otherwise. 

(k - 1, 0) if fj,[m - 1, k - 1, 0] + s m - f s > n[m - 1, k, 1] + s m , 
(k, 1) otherwise. 

Since computing fj,[m, k, £] immediately gives PREV"[m, k,£], we have the following lemma, 
Lemma 2.2 PREv[m, k,£] for all £ € {0, 1}, m < n and k < 2K can be computed in 0{nK). 

is given by S[n, 2K, 0], and the full strategy can be reconstructed in a single backward scan 



PREV[m, k, 0] = 



PREV[m, k, 1] = 
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using the following backward recursion (we introduce an auxilliary vector k), 



Tf[n] = 0, 



{K[n-l],T«[n-l]) = PREV[n, 2K,T^[n]), 
' 1-1 



(«[rn], [m]) = PREV"[m + 1, «[rn + l],7/f [m + 1]), forl<m<n — 1. 



Since the algorithm needs to store PREv[m, k,£] for all m, k, the memory requirement is 0(nK). 
Once again, it is not hard to generalize this algorithm to work with N instruments, and the resulting 
run time will be 0{nNK). 

3 Sterling-Optimal Trading Strategies 

It will first be useful to discuss some of the MDD properties of the return-optimal strategy 7^, as 
these properties will have implications on our algorithm to determine Sterling-optimal strategies. 
For a strategy T, it is useful to define the cumulative return series, Cf[i] as the sum of the returns, 
Cj-[i] = Y^j=i r T[j]- Note that ^(7^) = C^fn] > C^fn] = /u(T) for any strategy 7". The equity 
curve is given by £f[i] = exp (Cj-[i] + Y7j=i bj \ ■ 

3.1 Upper bound on the Maximum drawdown of a Sterling Optimal Strategy 

First, we will upper bound MDD(T^). Intuitively, the MDD of the return optimal trading strategy 
can not be larger than the bid-ask spread. This bound will be useful because the MDD of the 7^ 
serves as an upper bound for the MDD of the Sterling-optimal strategy, 

Lemma 3.1 MDD{Tstr\) < MDD{%). 

Proof: By definition, jSS^) ~ A?W) for ^ T ' Thus < MDD{T St rO < ^^MDD(T) for 
any 7". Choosing T = 7^ and noting that /x(7strl) < ^(Tli), we obtain the desired result. ■ 

Since the cost (in terms of the cumulative return) of entering and exiting a trade is — (fs + /b), no 
segment of the optimal trading strategy 7^ should lose more than this in return. 

Lemma 3.2 For any i<j, C % [j] - C % [i] > -{fs + h)- 

Proof: Suppose, for contradiction, that for some i < j, Cq- [j] — C^Ji] < —(fs + fs)- By setting 
Tn[i + 1], . . . ,7ji\j] to be all equal to 0, it is easy to verify that the cumulative return of the strategy 
must increase, which contradicts the optimality of 7«. ■ 
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For technical convenience, we will assume that the transactions cost when entering a trade is 
assigned to the time period prior to the entry, and the transactions cost when exiting a trade is 
assigned to the time period after the trade. Note that just prior to entering, the position is and 
so it will now have a return of —fs, and just after exiting, the position is 0, and will now have a 
return of — fs- 

Let f sp = fs + fB- From Lemma 15^} no segment of the optimal strategy can lose more than f sp , 
and so this immediately gives an upper bound on MDD(Tn)- For the trivial strategy that makes 
no trades, the MDD is 0. If a strategy makes exactly one trade, then there is a drawdown of at 
least fs at the begining, and of at least fs at the end. If at least two trades are made, then there 
is a drawdown of at least f sp between the exit of one trade and the entry of another, and since 
the drawdown cannot exceed f sp , the MDD must therefore equal f sp . We thus have the following 
lemma. 

Lemma 3.3 (MDD of 7~ M ) If 7^ makes no trades, MDD(T^) = 0. If makes one trade, 
maxl/s, /b} < MDD(Tp) < f sp . If makes at least two trades, MDD(T P ) = f sp . 

note that if we relax assumption Al, then by legging into a trade, it may be possible to decrease 
the drawdown, in which case Lemma 13.31 would no longer be valid. We are now ready to discuss 
the 0(n log n) algorithms to obtain Sterling-optimal trading strategies. First we will consider 
unconstrained Sterling optimal strategies, and then we will require number of trades < K. 

3.2 Overview of the Algorithm 

Based on the results in the previous section, the main observation is that if the Sterling optimal 
strategy makes two or more trades, then its MDD will be the bid-ask spread, and hence it must 
be the return optimal strategy. Thus we only need to consider the case when the Sterling optimal 
strategy makes exactly one trade. A useful observatioin is that no optimal strategy will exit a 
trade in the middle of a sequence of positive returns or enter a trade in the middle of a sequence 
of negative returns. Thus the return sequence can be contracted by combining any sequence of 
positive returns into a single return, ans similarily any sequence of negative returns. Possible entry 
points ai correspond to the beginning of a positive return and exit points the end of a sequence of 
positive returns. 

The main task is to find the Sterling optimal strategy making exactly one trade. We show 
that this problem can be reduced to convex hull operations in 2-dimensions, and hence obtain an 
0(n\ogn) algorithm for constructing the optimal solution. 

In order to construct the Sterling optimal strategy which makes at most K trades, the basic 
idea is to start with the return optimal strategy whose trade intervals cannot be enlarged (maximal 
return optimal strategies). If this strategy makes at most K trades, then we are back to the 
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unconstrained case. Otherwise, we show that one can successively merge neighboring trades in the 
return optimal strategy to obtain a sterling optimal strategy. The main difficulty in the algorithm 
is to determine which trades to merge, and we will show that a greedy merging strategy is in fact 
optimal. 

3.3 Unconstrained Sterling-Optimal Strategies 

For a degenerate trading system with all returns equal to zero, we define its Sterling ratio as 1. The 
only trading system with a MDD of is a degenerate trading system, so with this definition, the 
Sterling ratio is defined for all possible trading systems. The computation of the Sterling-optimal 
trading system breaks down into three cases, according to the number of trades its makes: 

Sterling-optimal that makes zero trades. Sterling Ratio is 1. 

Sterling-optimal that makes one trade. The trading strategy contains a single interval of l's. 

Sterling-optimal that makes at least two trades. Any trading system that makes at least 
two trades has an MDD > f sp . Since MDD(T^) < f sp (Lemma I3.3p . 7^ has the smallest 
MDD among all such systems. Since it also has the highest total return, we conclude that if 
the Sterling-optimal system makes at least two trades, then 7stri = Tp,- 

The first case is trivially computed. The third case, i.e., the Sterling optimal strategy that makes 
at least two trades can be computed in linear time using the dynamic programming algorithm to 
compute Tp- If we also compute the Sterling-optimal system that makes exactly one trade, then, we 
solve our problem by taking the case with the maximum Sterling ratio. We now focus on finding 
the trading strategy that makes only one trade and has greatest Sterling Ratio among all such 
strategies. 

Let T be a strategy that makes exactly one trade. The trade interval is the interval of time 
periods, [ti,tj] on which T = 1, i.e., the trade interval is an interval of l's in the trading strategy. 
An elementary algorithm that considers all the 0(n 2 ) possible trade intervals, picking the best is 
a quadratic time algorithm. The remainder of this section is devoted to providing an algorithm 
which computes such a strategy in 0{n log n) time, which will complete the proof of the first part 
of Theorem 11.31 In fact the algorithm that we present is a much more general algorithm that 
computes the single interval that optimizes a general class of optimality criteria. This algorithm 
will be useful when we discuss the Sharpe-optimal strategy. 

Consider a consecutive sequence of time periods ti, ii+i, • • • , ij+fc where k > 1, with all the excess 
returns non-negative and the last one positive, i.e., Sj > 0, Sj+i > 0, . . . , §i + k > 0. 

Lemma 3.4 Either the optimal single trade interval does not intersect these time periods, or an 
optimal single interval can be chosen to contain this interval. 
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Proof: Suppose that T[i + j] = 1 and T[i + j + 1] =0 for some < j < k. Extend the trading 
interval by setting 7"[i+ j + 1] = 1, • • • , T[i + k] = 1, which adds positive return, without increasing 
the MDD, contradicting the optimality of the original interval. On the other hand, suppose that 
T[i + j] = 1 and T[i+j — 1] = for some < j < k. Once again, by extending the trading interval, 
setting T[i] = 1, . . . ,T[i+j] = 1, we add non-negative returns, without increasing the MDD hence 
this new interval is at least as good as the previous interval. ■ 

A similar result holds for a sequence of consecutive negative time periods, tj,ij + i, . . . , ij+fc where 
k > 1, with Si < 0, s~i+i < 0, . . . , Sj+fc < 0. If an optimal trading interval only intersects part of 
these time periods, this intersection can be removed without decreasing the Sterling ratio. Thus, 
by Lemma 13.41 any sequence of time periods with all returns non-negative (non-positive) can be 
condensed into a single time period, t\ = t{ + • • • + t i+ ^, with s[ = Si + ■ ■ ■ + Sj+fc. Further, this 
operation can be performed in linear time on the entire excess return sequence, so from now on 
we assume without loss of generality that the excess return sequence consists of alternating time 
periods of strictly positive and negative excess returns. If Si < 0, then ti cannot be the first 1 of 
a trade, since by entering one time period later, we exclude only this negative return and do not 
increase the MDD. Similarily, it cannot be the last 1 of a trade. 

Lemma 3.5 The first 1 and the last 1 of the optimal trade interval must occus at time periods tf 
and t[ for which Sf > and §i > 0. 

The pictorial illustration of this lemma is given below where we show the cumulative return curve. 
The time instants ai are the possible entry points, and the time instants 6j are the possible exit 
points. 



Let the alternating sequence of entry and exit points be {a\, b±, a2, 62, . . . , a^, bu] {ai are the entry 
points, and bi are the exit points). Note that after the preprocessing into alternating intervals, 
k < \ n/2 ] . Notice that without loss of generality, we can throw away the first interval if it has a 
negative return, as it can never be an entry point, and the last interval if it has a negative return, 
for a similar reason. The optimal trade interval will be of the form (at, &t+r) ; r > 0. 

Our algorithm for finding the Sterling-optimal interval will be to consider every possible starting 
point ai, and find the Sterling-optimal interval with this point as starting point (i.e. we have to 
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find the end point of this interval). As the algorithm proceeds, we keep track of the best entry 
point (and its corresponding exit point). The entry points at are processed from right to left. 
After processing a new entry point at, we will modify the alternating sequence to facilitate faster 
processing of the remaining points. More specifically, we will delete the processed entry point and 
add a weight to the edge between bt-i and bt to preserve all the necessary information - we cannot 
simply delete the entry point at, since we have to keep track of maximum MDD that occurs in our 
activity interval. Since between and at we have a drawdown of bt-i — at, we need to keep this 
information in an edge weight connecting bt~\ to bt- At any stage of the algorithm, the edge weight 
connecting bt~\ to bt will be equal to the MDD of the interval [bt-i, bt], and this MDD is realized on 
a prefix of \bt-\,bt\, i.e. MDD([bt~i,bt]) = C{bt-\) — C{x), for some x S [b t -i,b t ]. This property 
is an invariant of the algorithm, which we will denote by (*), and it will be maintained throughout 
the algorithm. We will explicitly show this weight attached to bt in parentheses as {wtjbt, where 
the value of wt appearing in parentheses indicates the weight. 

We start our backward scan at the last entry point, a k , for which there is only one possible 
interval (a k ,bk)- We set the weight 

Wk = h-i ~ a k , 

which is the drawdown of the interval from b k _\ to b k . We also store the current best interval 

(a k ,b k , Strife), 

where Strife = bk a | ^ sp ; f sp = fs + fB', and, / = max{/g, We now delete the possible entry 
point a k from the sequence to give the processed sequence {a\, b\, a^-i, bfc-i, (^fc)&fc}- Note that 
(cife, bk, Strife) is a one-step trade, but we keep it here for simplicity. We now proceed to processing 
afe_i and so the process will continue until all aj have been processed. 

In the general case, suppose we have processed (backwards) all the entry points up to (including) 
the entry point a t +\, and are currently processing entry point a t . The weighted exit sequence is 

{ai, h, a t , b t , (w t+ i)b t+ i, (w t+m )bt+ m } , 

where bt, ■ ■ ■ , bt+ m are the possible exit points - note that t + m may not equal k due to possible 
deletion of points which we discuss below. Assume that {&t+i < . . . < 6<+ m }: this is true after 
we have processed the first start point (since the sequence consists only of one point), and we 
will maintain this condition by induction. If bt < bt+i, then the entire sequence of exit points is 
monotonically increasing. On the other hand, suppose that bt > &t+i- Then, 

return(at,b t ) > return{at,bt+\), and MDD(at,b t ) < MDD(at,b t +i), 
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and so bt+i need not be considered as an exit point for any optimal interval with entry point at or 
earlier, because by stopping earlier at bt, we do not decrease the cumulative return, nor increase the 
MDD. Hence, we can delete the possible exit point bt+i- However, we must now update the weight 
in (wt+2)bt+2 to store the new drawdown between b t and bt+2, wt+2 <— max{wt+i, Wt+2 + b t — b t +i). 

Lemma 3.6 Ifbt > bt+i, the weighted exit sequence is updated as follows: 

at, b t , (w t+ i)b t+ i, (w t+ m)bt+ m -> a t , b t , (max{w t+ i , w t+ 2 + h - b t+ i})b t+ 2, • • • , (w t+m )b t+m 

The correctness of the weight updating rule above follows from the invariant (*) which is preserved 
by the transformation. 

This process is continued until the next exit point after bt is either above bt or there are no 
remaining exit points after bt- In either event, the new sequence of exit points available for at is 
strictly monotonically increasing (by induction). Observe that any deletion of a possible exit point 
is a constant time operation. Further, since each deletion drops a point from the set {bi, . . . , b^}, 
there can be at most k — 1 such deletions during the course of the entire algorithm. We thus have 
the following lemma. 

Lemma 3.7 When at is processed by the algorithm, the exit points bt < bt+i < ■■• are monotoni- 
cally increasing. The cost of maintaining this condition for the entire algorithm is 0{k) operations. 

When processing at, the weighted exit sequence is {a±, b±, a t , b t , (wt+i)bt+i, ■ ■ ■ , (wt+ m )bt+ m } ■ 
Suppose that w t +2 < < • • • < w t + m . Initially this sequence is the empty sequence and so this 
condition holds, and once again, by induction we will ensure that this condition will always hold. 
Suppose that wt+i > w t +2- By construction, bt+2 > &t+i (Lemma 13. 4p . and so 

return(a t ,bt+i) < retum(a t ,bt + 2), and MDD(a t ,b t+ i) = MDD(a t ,b t+ 2)- 

Thus, no optimal interval can have entry point at (or earlier), and exit at bt+i, because by exiting 
at bt+2, the MDD is not increased, however the total return is increased. Thus if wt+i > wt+2, we 
can remove the possible exit point bt+i from the weighted exit sequence and update wt+2 Wt+i- 
Note that this transformation also preserves the invariant (*). 

Lemma 3.8 If wt+i > wt+2, the weighted exit sequence is updated as follows: 

at, h, {w t +i)bt+i, ... , {wt+m)bt+ m -> a t , b t , (w t +i)b t +2, ■■■ , {w t + m )bt+m 

We continue removing exit points in this way until either there is only one weight left in the weighted 
exit sequence, or all the weights are strictly monotonically increasing (by induction). 
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Suppose that wt+i < /• In this case, we observe that bt cannot be the exit of an optimal interval 
with entry at— r , where r > 0. To see this note that if bt — ctt-r ~ fs < 0, then the return of this 
interval is negative and this interval cannot be an optimal interval. If bt — <H—r ~ fs > then since 
the interval already has MDD of at least /, so by continuing to we do not increase the MDD 
but strictly increase the return, hence it cannot be optimal to exit at bt- 

Lemma 3.9 Let 

at,h, (w t+ i)b t +i, ... , (w t+m )bt +m 

be the weighted exit sequence and letT G [t, . . . , t + m] be such an index that wt < f and > /• 
Then no Sterling- optimal interval that starts at at or earlier can exit at {bt, . . . ,br-i}- 

Lemma 3.10 When at is processed by the algorithm, the weights Wt+i are monotonically increas- 
ing. The cost of mainitaining this condition for the entire algorithm is O(k) operations. 

We thus assume from now on that when processing entry point at, the weighted exit sequence 
{oi, b\, at, bt, (wt+i)bt+i, ■ ■ ■ , {wt+m)bt+m\ with m > satisfies the conditions of Lemmas l3.7l and 
13.101 The first available exit gives the trade interval (at,bt). If bt — at — f sp < 0, i.e., if the return 
is not positive, then this cannot possibly be an optimal interval. Otherwise, the Sterling Ratio is 

stri, = bt ~y^ , 

where f sp = fs + Ib and / = maxj/g, /#}. Now consider the exit points br, where T > t and 
suppose that &t — wt+i < a t . In this case, for r > 0, 

return(at,bT+ r ) < return(aT+i,br+r), 

and so no trade with entry at at, exiting at bx+ r can possibly be optimal, since we could always 
enter later at &t — wt+i, exit at &T+r ; and strictly increase the return without increasing the 
drawdown. We are thus done with processing the entry point at, and we can proceed to at—\ after 
updating weight w% and comparing bT a j ^ sp with the current champion. Similarly, if 6j — < at 
for some t € [t, . . . ,T — 1], we are done with the starting point a t , and we can proceed to a t -i 
after updating weight wt- We assume that at any stage of the algorithm we keep the value of 
i r nin'telt,...,T-i]bt ~ w t+i an d thus this check can be done in constant time for any given point at- 
Thus, without loss of generality, we can assume that bx — wt+i > at and bf — wj + i > at for all 
i 6 [t, . . . ,T — 1]. Since wt+i > f, we conclude that bx — at > f ■ A trade, entering at a t and 
exiting at br+ r , r > has total return &r+r — at — f sp - The next lemma gives the MDD. 
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Lemma 3.11 Assume that bx — wt+i > a t and bj — wi+\ > at for all t £ [t, . . . , T — 1] . The trade 
(a t ,bT+ r ), f > 0, has MDD = wt+t- 

Proof: The local maxima of the cumulative return sequence for this trade are {0, bt — at — 
fs, , ■ ■ ■ ,b T - a t - fs, ■ ■ ■ , b T+r -at- fs}- Since 6f - - a t > Vt G [t, . . . , T - 1], MDD of the 
interval [at, by] is equal to /g. 

Since bx — a t — fs > and since the sequence of exit points is strictly increasing, 
MDD([a t ,b T+r ]) = max(MDD([a t ,b T ]),MDD([b T ,b T+r ])) = max(f s , MDD([b T ,b T+r )), f B ) 
where fs is the draw down after the last point &T+r- 

Since the drawdown at any time in a trade is given by the the diffenence between the previous 
maximum and the current cumulative return, MDD([bT, &T+r)) is at most maxj g r 1)f .i it;t+i- Since 
the weights are monotonically increasing, we see that this drawdown < u>t+ r , which is achieved in 
the interval (&t+r- 1> h+r)- Since wt+ r > / = max(fs,fB) Vr > 0, we conclude MDD([at, &T+r]) = 

U^t+r- ■ 

Summary: For entry point at, the sequence of exit points bx+r, r £ [0,m] have cumulative 
returns c r = bT+r — at — f S p and MDD's d r = WT+r for r > and do = /. The task is to maximize 
c r /d r with respect to r. The sequences {c r } and {d r } are both strictly increasing. We now describe 
a general algorithm for performing such a maximization. 

3.3.1 Maximizing ^f- 

d r 

Fix t and consider the set of trades (at,bt+ r ), r > 0. Corresponding to each trade is the pair 
(d r ,c r ). As discussed in the previous section, by construction, the sequences {d r } and {c r } are 
strictly increasing. On the two dimensional plane, consider the set of points P m = {(d r , c r )}r l =o 
defined by this set of pairs. Let p = (0, 0). Then the slope of the line joining p to (d r , c r ) is exactly 
the Sterling ratio of the trade (at, 6 i+r ). Thus, finding the optimal trade is equivalent to finding the 
upper-touching point from p to the convex hull of the set of points in P m (see illustration below). 
We call p the source point. 




P d r 
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We get the same result if we define P m = {(d r , b t + r )}™=o, P = (0, a t + f sp ). It is known that, given 
the convex hull, this touching line can found in time 0(logA) where A is number of the points 
on the convex hull, [5] . It is easy to see that A < m + 1 < \ n/2~\. In order to compute this 
touching point in O(logvl) time, one must be able to efficiently search through the convex hull, we 
now describe the mechanism for facilitating this search. The details are not important. What is 
important is that this data structure can be constructed and efficiently maintained as we update 
the set {d r ,c r }. We will need to update this set as we process the points at backwards. 

We accomplish the fast construction of the upper tangent point of the convex hull by maintaining 
the convex hull as a doubly linked list, where each point in the convex hull maintains 0(logA) 
pointers to other points in the convex hull. More specifically, for point i we consider the set of 
points {(d r , c r )}J!Lj (point i, (di,Ci) is the first point in this set). Corresponding to this set is some 
convex hull Cj. Point i is also the first point in this convex hull (by the strict monotonicity of the 
d r ,c r ). Point i maintains a forward pointer to points at position 2? for j > 1 in the convex hull 
Ci of the points {(d r , c r )}'!^ =i . Clearly point i maintains O(logm) forward pointers. Each point 
also maintains backward pointers to any point that points forward to it. At point j, the backward 
pointers specific to the convex hull starting at point i < j are maintained separatly for each i so 
that constant time access to this set of pointers is possible. An array of an array of pointers suffices 
to accomplish this. It is clear that the worst case memory requirement is 0(m log m) pointers. 

We now discuss the main operations we would like to be able to do on our set of points {(d r ,c r )} 
and the point p and still be able to update our convex hull data structure efficiently, and hence 
efficiently compute the upper tangent line. First we recall that the d r are monotonically increasing. 
Assume that each point (d r ,c r ) stores all the necessary forward and backward pointers. Note that 
the point (d r , c r ) stores a forward pointer to the point nxt(r), which is the next point in the convex 
hull if all the points do, ... , d r -\ were removed - remember that in this case, d r becomes leftmost, 
and so must be in the convex hull. We will see that these assumptions are maintained inductively. 
We state the next observation lemma. 

Lemma 3.12 The initial convex hull with all the points is given by (do,co) followed by the points 
pointed to by nxt(0), nxt(nxt(0)), .... 

As we move from at to process at—x, the set of points {d r ,c r } will change. However, they do not 
change arbitrarily. As we will show, the changes can be reduced to combinations of the following 
four operations on the set of points P m . 

(i) Translate p by some given vector v. 

(ii) Translate all the points in {(d r ,c r )} by some given vector v. 

(iii) Remove the leftmost point (do,co). 
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(iv) Add a new leftmost point (gLi,c_i). 

If we can efficiently update our data structure after any of these four operations on the points in 
P m , then we will be able to compute the upper tangent point efficiently for the new set. This is the 
content of the next lemma. Since it is a rather technical lemma, we give the proof in the appendix. 

Lemma 3.13 Assuming that p, {(d r , c r , nxt r )}J!L 1 are given, all the operations in (l)-(4) above 
can be accomplished in time 0(log?n). Further, in the event that a point is added or deleted, all 
necessary pointers are maintained. 

We will now show that these four operations are all that is needed for maximizing the Sterling ratio. 
Suppose that point at has been processed in the algorithm - i.e., the upper tangent point (optimal 
trade with entry at at) from p t = (0,aj + f sp ) to P m = {(d r , bt+ T ) s ™ = Q has been computed. Now 
consider the addition at~\ to construct the new weighted exit sequence. First, delete the leftmost 
point (do,bT) from the convex hull (operation (hi)). Lets consider all posible operations that may 
take place when processinig Ot-i- There are several possibilities, corresponding to the cases which 
occur in the algorithm to ensure that the sequence {d r , c r } is strictly increasing. 

1- bt-i > bf We remove (leftmost) points bt+i (operation (hi)), i > 0, until bt~i < bt+i+i, and the 
new weight w' t+i+1 may have increased (Lemma 13. 6p . Deletion of points bt+i from the weighted 
exit sequence doesn't change the convex hull until bt+i is a possible exit point. After this point, 
deleting one point from the weighted exit sequence also deletes the corresponding leftmost point 
of the convex hull. At the very end of the sequence of deletions, we have to update the MDD 
of point bt+i+i from wt+i+i to w' t+i+1 , this can be done by deletion of the (leftmost) point 
(wt+i+i, bt+i+i) and addition of new (leftmost) point (w' t+i+1 , bt+i+i). The total number of such 
removals during the entire algorithm is at most n — 1. When condition bt~\ < bt is satisfied, 
proceed to the next stage. 

2. b t -i < b t . 

(i) wt+i < wt- We remove (leftmost) points bt+i, i > 0, until wt < wt+i+i- Deletion of points 
bt+i from the weighted exit sequence doesn't change the convex hull until bt+i is a possible 
exit point. After this point, deleting one point from the weighted exit sequence also deletes 
the corresponding leftmost point of the convex hull. By Lemma 13.101 the total number of 
such removals cannot exceed n — 1 over the course of the entire algorithm. When condition 
wt < wt+i is satisfied, proceed to the next stage. 

(ii) b t ^i < b t and wt < wt+i- 

(a) f > w t . Add to the convex hull the point (/, br)- 
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(b) / < w t . Add to the convex hull points (wt,bt) and (/, f>t-i). 

The new source point is Pt-i = (0, a t _i + f sp ), which just corresponds to a shift of p t , and so 
once the new convex hull for the new weighted exit sequence is computed, an additional O(logra) 
operations are needed to find the new upper tangent point. 

The total number of removals in the entire algorithm is 0(n). For each new entry point, we 
have at most a constant number of additions, and since the number of entry points is 0(n), we see 
that the total number of additions is 0{n). By Lemma 13.131 we have that each operation takes 
O(logn) in the worst case, thus the total run time is O(nlogn). Collecting all the results together, 
we can find the Sterling-optimal strategies making zero, one or more than one trade in O(nlogn) 
time, completing the proof of the first part of Theorem 11.31 

Note that by only maintaining exit points with weight at most some given constant, MDDq, i.e., 
by truncating some region of the points to the right, this algorithm is easily extended to computing 
the Sterling-optimal strategy that uses exactly one trade and has an MDD < MDDq. 

Proposition 3.14 Given MDDq, a Sterling- optimal strategy that uses exactly one trade and has 
MDD < MDDq can be computed in 0{n log n) time. 

This result will be useful when we consider constrained Sterling-optimal strategies. 
3.4 Constrained Sterling-Optimal Strategies 

As with the return-optimal strategies, the unconstrained sterling-optimal strategies may make too 
many trades. Here, we consider the the Sterling-optimal strategy that makes at most K trades, 
^stri - We- refer to such strategies as X-Sterling-optimal. First, we present some properties of this 
strategy, before giving an efficient algorithm to compute it. 

A maximal return-optimal strategy T* is a return-optimal strategy whose trade intervals cannot 
be enlarged. Given any return-optimal strategy 7^, in one (linear time) scan from left to right, 
we can enlarge any trade intervals maximally to the right as long as they keep the same return. 
Similarily, in one backward scan, we can extend all trade intervals maximally to the left. Since 7^ 
can be computed in linear time, we conclude that 

Lemma 3.15 A maximal return- optimal strategy 7^* can be computed in linear time. 

If any trade interval of a maximal return-optimal strategy is extended in either direction, then the 
total return must strictly decrease. In the previous section, we gave an O(nlogn) algorithm for 
computing the Sterling-optimal strategy with exactly 1 trade. We also saw that if the unconstrained 
Sterling-optimal strategy contains more than 1 trade, then it is T*. Fix K, and let the number of 
trades that T* makes be Kq < K. In this case T^ r \ = 7T, and we are done. Thus we only need 
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to consider the case that 1 < K < Kq. Some important properties of T* are summarized below. 
When it is clear, We also use 7^* to refer to the set of trading intervals {irjyj^. Let Cj = Y^j=\ % 
denote the cumulative return sequence of the excess returns. 

Lemma 3.16 Let 7^* be maximal return-optimal. Let I be an interval \pi,tj\. 

i. Ifle T;, then, Yi k=i h -fs P >0 and MDD(I) < f sp . 

ii. Suppose I does not intersect with any interval in T* and let the return of I (Y^ k=i Sk) be //(/). 
Then, n(I) < f sp . If I is adjacent to some interval of , then < 0. If I is adjacent to 
two intervals in T* , then n(I) < —f sp . 

Hi. Let [ti,t r ] and [ti',t r /] be two consecutive trade intervals in T* I < r < V < r' . Then, 
C r — Cy > f sp , and for all r < q < V , C\> < C q < C r . 

Let {(ai,bi)} denote the local minima and maxima of {Cj}, as in the previous section. Any trade 
of or 75^1 must enter (exit) at a local minimum (maximum). Further, the entry (exit) point 
must be a minimum (maximum) in the trade, otherwise we can shrink the trade, strictly increasing 
the return without increasing the MDD. 

Lemma 3.17 Let I = [t;,t r ] be a trade interval of or T^ r \- Then Ci is a local minimum, C r is 
a local maximum, and for any k, with I < k < r , C\ < < C r 

We now give an important inclusion property of the strategy Essentially, it states that the 

Sterling optimal strategy can be constructed by merging some trades of the return optimal strategy. 
We give the technical proof in the appendix (which will be the case with most of the technical results 
in this section). 

Proposition 3.18 LetT* be a maximal return- optimal trading strategy. There exists a K -Sterling- 
optimal strategy T^ r \, K > 1, with the following property: if I = [ti,t r ] is any trading interval in 
7c^r|, then a prefix of I and a suffix of I are trades in the maximal return- optimal strategy T*. 

As a result of Proposition 13.181 we assume from now on that every interval of the sterling optimal 
strategy T$£ r \ is prefixed and suffixed by (not necessarily distinct) intervals from a maximal return- 
optimal strategy that makes Kq trades. 

Lemma 3.19 If 1 < K < K then T£ r] can be chosen to make exactly K trades. 

Lemmas 13.181 and 13.191 indicate how T$£ r \ can be constructed: start with all the intervals of a maxi- 
mal return-optimal strategy T* and then merge some neighbouring intervals, keeping the merging 
sequence that gave the best strategy. The number of possible merging sequences is exponential, 
however, we will now show that an efficient greedy merging algorithm gives the correct result. 
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Given two consecutive non-adjacent intervals I\ = [^ 1 ,t ri ], I2 = [ti 2 ,t r2 ], where I\ preceeds 
I2, define the bridge B{I\,l2) = [t ri ,ti 2 ] to be interval connecting I\ with I2. If I\ and I2 are 
intervals in a maximal return optimal strategy, then by Lemma 13.161 the MDD of the bridge 
is C ri — Ci 2 . Since C n is a maximum over the interval [t^,^], and C[ 2 is a minimum over the 
interval [t ri ,t r2 ], we have that the MDD of the union of these three intervals, [i^,^] is given by 
max{MDD(h), C n - C h ,MDD{h)}. 

For every bridge B{I\, I2), define the closure Cl(B(Ii, I2)) to be the smallest interval J = [ti, t r ], 
in the return sequence, satisfying the following three properties. 

Cl\. Ci < C m < C r for I < m < r, i.e., Ci is a minimum and C r is a maximum in [^,t r ]. 
Cl2- I\,l2 C J, i.e., J contains both I\ and I2. 

CI3. J is prefixed and suffixed by intervals from a maximal return-optimal strategy T*. 

Note that a bridge may not have closure. For example, if the two last intervals Ii-i,Ii in T* are 
such that such that the end point l\ is below the end point of Ii—i, then B(Ii-\,Ii) doesn't have a 
closure. The next lemma shows that if the closure J for a bridge exists, then not only is it unique, 
but any other interval satisfying Cl\ - CI3 contains J. 

Lemma 3.20 For two intervals I\,l2, if Cl(B(I\, I2)) exists, then it is unique. Moreover, for any 
other interval I satisfying Cl\ - CI3, Cl(B{I\, I2)) C I. 

Proof: Let J\ = [tj^trj and J2 = \ti 2 ,t r2 \ satisfy Cl\ - CZ3. Without loss of generality, assume 
that ti x < ti 2 < t n < t r2 . By construction, JiflJ2 = [ti 2 ,t ri ] satisfies Cl\ - CI3. Now let Cl(B{I\, I2)) 
be the intersection of all intervals that satisfy Cl\ - CI3, concluding the proof. ■ 

Suppose that bridge B and B' are bridges in 7^* and that Cl{B) contains B' . Then Cl{B) satisfies 
Cl\ - CI3 with respect to B' and hence Cl{B) also contains Cl(B'). 

Lemma 3.21 Let B and B' be bridges in T* . If B' C Cl(B), then Cl(B') C Cl(B). 

Any interval in 7~£ r \ containing bridge B satisfies properties Cl\ - CI3 (Lemma 13.171 &; Proposition 
I3.18p . immediately yielding the following proposition. 

Proposition 3.22 Let I £ and let B be a bridge in 7^*. 

1. IfBc I, then Cl(B) C I. 

ii. If B does not have a closure, then no K- Sterling- optimal strategy can contain B. 

Hi. A K- Sterling- optimal strategy with more than one trading interval and no bridges of 7^* has 
MDD = f sp . If it contains one or more bridges B{ ofT^ , then MDD = maxj MDD{Cl{Bi)). 
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iv. The MDD of a K- Sterling- optimal strategy with more than one trading interval can be one of 
at most T + 1 possible values where T is the number of bridges between the intervals ofT*. 

Proof: (i) and (ii) are immediate, (iv) follows from (iii), thus we only need to prove (iii). Let 
/ € 7g^| contain the consecutive bridges B%, . . . , Bx, and hence their closures. From (i), it is 
clear that MDD(I) > maxj MDD(Cl(Bi)). It is also clear that I = ufCZ(Sj). We show, by 
strong induction on K, a more general statement than we need: suppose that I = ufCl(Bi), then 
MDD(I) < m&XiMDD(Cl(Bi)). If K = 1 then / = Cl(Bi) and the result is trivial; suppose 
it is true for up to K — 1 consecutive bridges, K > 1, and suppose that I is the union of K 
closures of consecutive bridges. Consider the first closure Cl(B\). Let / = [^,i r ] and Cl(Bi) = 
[ti,t r i], t r i < t r . By definition of Cl(Bi), C T > is a maximum over [tj,t r /]. Thus, MDD(I) = 
m.ax{MDD(Cl(B 1 )),MDD([t r > i t r ])}. If r = r' , then I = CZ(Bi) and we are done. If r < r' , 
then V+l is the begining of some bridge B K . Let I 1 = \J?L K Cl{Bi). Then, [V,i r ] C V and so 
MDD([t r /,t r ]) < MDD(I'). But /' is the union of at most K — 1 closures, so by the induction 
hypothesis, MDD(I') < maxj> K MDD(Cl(Bi)), concluding the proof. ■ 

We will distinguish between four types of bridges. Let I\ = [fy 15 £ ri ], I2 = [ti 2 ,t r2 ] be consecutive 
intervals in 7^*. The bridge B = B(Ii, I2) can be one of four types: 

regular. C\ x < C\ 2 and C ri < C r2 , i.e., Cl(B) = [h,r2]. 

right irregular. < Ci 2 and C ri > C T2 , i.e., Cl(B) contains the next bridge. 

left irregular. > Ci 2 and C ri < C r2 , i.e., Cl(B) contains the previous bridge. 

irregular. C n > C T2 and > Ci 2 , i.e., Cl(B) contains both the next and previous bridges. 

We define the weight of the bridge W(B(Ii, I2)) as follows: 



W(B(h,I 2 )) 



C ri - Ci 2 if B(Ii,I 2 ) is regular, 

C ri — Ci 2 if B(I\,l2) is left irregular and the previous bridge is right irregular. 

Cn — C 2 if B(Ii,l2) is left irregular and the previous bridge is irregular. 

+00 otherwise. 



The general idea behind our algorithm is to start with a maximal return-optimal strategy and 
greedily merge pairs of intervals or pair of bridges according to the bridge weight, keeping track 
of the best K intervals each time. When no more merging can occur, because we are down to K 
intervals or all the bridge weights are 00, we return the best K intervals we have seen so far. More 
precisely, let T£ = {h, • • • 1 Ik } be a maximal return-optimal trading strategy making Kq trades. 
We denote this pool of trade intervals by Pq, the base pool. From pool Pi, we obtain pool Pj+i by a 
single merge according to the following rule. Let B = B{I\,l2) be the bridge with smallest weight. 
If B = 00, stop (pool Pi + i does not exist). Otherwise, there are two cases. 
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i. Regular merge: if B is regular, merge B with I± and I2 to get a larger interval I new = [i/^i^]. 
We now update the status (type and weight) of any neighboring bridges as follows: 

• Previous bridge changes from right-irregular to regular (update type and weight). 

• Previous bridge B' changes irregular to left-irregular (update type). If the bridge previous 
to B' is right-irregular or irregular then update weight. 

• Next bridge changes from irregular to right-irregular (update type). 

• Next bridge changes from left-irregular to to regular (update type and weight). 

ii. Irregular merge: if B is left irregular, denoting the previous bridge B* , merge the two bridges 
and the interval between them into one bigger bridge B new = B*\Jl\\JB. The status of bridges 
other than B have not changed. The status and weight of B may need updating. 

Intervals are formed only by regular merges, so it is easy to see that intervals resulting from this 
merging procedure begin at a minimum and end at a maximum. New bridges are formed by 
irregular merges, and the resulting bridge must begin at a maximum and end at a minimum. The 
only bridge weights that could change are those that had weights of 00. In such an event the weight 
will drop, but not below the weight of the original bridge used in the merge that led to the update. 

Lemma 3.23 Let bridge B with weight w be involved in a merge, and suppose that the weight of 
bridge B' is updated in the process from 00 to u. Then, w < u. 

Proof: There are two cases: 

i. The bridge involved in the merge was regular, i.e., two consecutive intervals 1\ = [i^,^] 
and I2 = [ti 2 ,t r2 ] are merged with their bridge B12 = B(Ii,l2) with W(B\2) = w. Let the 
preceeding interval be Jo = [iz ,tr ] an d the following interval be ^3 = [tj 3 ,ir 3 ] , and let the 
preceeding and following bridges be Bq\ and B23 respectively. If Bqi obtained finite weight, it 
cannot be left irregular, as it would remain left irregular after the merge, and hence its weight 
would still be 00. Thus, we need only consider Bq\ right irregular or irregular (i.e., C ro > C ri ). 
Its weight becomes u = C ro — C[ L > C ri — Ci x . Since B12 is regular, w = C ri — Ci 2 < C ri — C/ 1 
and so w < u. If B23 obtained finite weight, then it could not be right regular or irregular as 
it could not become regular or left irregular after the merge. Thus, we need only consider B23 
left irregular (C/ 2 > Q 3 ). Its weight becomes u = C r2 — C\ 3 > C r - 2 — C[ 2 . Since B\2 is regular, 
C ri < C r2 , and so u > C ri — C\ 2 = w. 

ii. The bridge involved in the merge was left-irregular, i.e., B\2 = [t ri ,ti 2 ] is left irregular, and 
-E>oi = [tr >*Zi] i s either right-irregular or irregular (in both cases, C ro > C n ). Let w = C ri —C[ 2 
be the weight of Byi- The merged bridge is B = B01I1B12. If B has finite weight (i.e. it is 
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either regular or left-irregular), then its new weight is u = C ro — C\ 2 > C ri — C\ 2 = w. If 
B is left-irregular or irregular, then it does not change the weight of any other bridge. If, 
on the other hand, B became right-irregular or irregular, then it could affect the weight of 
the following bridge P23, if P23 was left-irregular (Ci 2 > C/ 3 ). In this case, the weight of 
B23 = [t r2 ^i 3 ] becomes v = C r2 — Q 3 > C r2 — C\ 2 . But since B\2 was left- irregular, C r2 > C n , 
and so v > C ri — C\ 2 = w. 

■ 

The next lemma shows that if all the bridge weights become 00, any further merged pairs of intervals 
can never be part of a i^-Sterling-optimal strategy. 

Lemma 3.24 If all the bridge weights in a pool of intervals are 00, then any further merged pairs 
of intervals from this pool can never be part of a K -Sterling- optimal strategy. 

Proof: (Lemma l3.24p . Let P r be pool obtained from Pq by some sequence of merging intervals with 
bridges of finite weight, and suppose that all the bridges in P r have infinite weight. In particular, 
this means that none of the bridges in P r are regular. Denote the bridges by B±, . . . ,B m , and 
consider bridge Bk- If Bk is right irregular or irregular, then all following bridges are either right 
irregular or irregular since all bridges have finite weight. If a trading interval contains Bk, it must 
contain Bk+i (since Bk is right irregular or irregular), and so by induction, it must contain all the 
following bridges (and their closures). But, the last bridge does not have a closure (as it is right 
irregular or irregular), a contradiction. If on the other hand, Bk is left irregular, then all preceeding 
bridges are left irregular as all bridges have infinite weight. If a trading interval contains Bk, it must 
contain Bk-\ (since Bk is left irregular), and so by induction, it must contain all the preceeding 
bridges (and their closures). But, the first bridge does not have a closure (as it is left irregular), a 
contradiction. We conclude that Bk cannot be in any trading interval. ■ 

Each merge decreases the number of intervals and number of bridges by one. If we merge down 
to pool Pk -k, we are left with exactly K intervals. We will show that T$£ r \ can be chosen to be 
the best K trades (with respect to total return) in one of these pools. Specifically, define Tj K to 
be the K intervals in Pj with the highest total return. We say that a strategy is coarser than pool 
Pj if the strategy can be obtained by a sequence of merges of some (or all) of the intervals in Pj. 
Clearly, Vi, P«+i is coarser than Pj, as Pj+i is obtained from Pj after a single merge. Note that for 
a strategy to be coarser than Pj, it need not contain every trade in Pj, however if it contains part 
of any trade in Pj, then it contains the entire trade. Next, we show that after a merge, the MDD 
of the remaining intervals is equal to the weight of the bridge involved in the merging. 
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Lemma 3.25 If pool Pi, i>l, was obtained from Pj_i by a merge involving a bridge of weight w, 
then the MDD of any interval in Pi is at most w. If the merge created a new interval (i.e., the 
bridge was regular), then the MDD of the new interval is equal to w. 

Proof: In pool Po, since any bridge is adjacent to two intervals of T*, its weight is at least f sp 
(Lemma 13. 16j) . Consider sequence of pools Pq, P\, . . . , P r , where bridge Bi with weight W(E>i) was 
the minimum weight bridge involved in the merge that resulted in pool Pi from from Pj_i. By 
Lemma T3.23I bridges weights are non-decreasing, i.e., W(B>i) < W{Bi + \). 

We now use induction on the index i. For i = 1, from Lemma 13.161 every interval in Po has 
MDD at most f sp . If Pi was obtained from Po by an irregular merge, then all intervals of P\ are 
intervals of Po, with MDD at most f sp . Since W{B\) > f sp , the claim holds. If the merge was 
regular, then the MDD is W{B\) > f sp and the MDD of all other intervals is at most f sp . Thus, 
the claim holds for P\. 

Suppose the claim holds for all j < i and consider pool Pi which was obtained from Pj_i using 
a merge involving Pj. By the induction hypethesis, the MDD of any interval from Pj_-i is at 
most W(Bi-i) < W{Bi). If Pi that was obtained by an irregular merge, every interval of Pi is an 
interval of p_i and thus has MDD at most VF(Pj_i) < W{Bi). Suppose that Pj was obtained by 
a regular merge - all intervals except the merged interval are intervals of p_i. Consider the MDD 
of the new interval, which is obtained by the regular merge I\ U Bi U 12- Since new intervals are 
created only through regular merges, it is easy to see by induction that property Cl\ holds for all 
the intervals in Pj_i, in particular it holds for I\ and 12- Since Bi was regular, the MDD of the new 
interval is max(MPP(Ii), W(B,i), MDD(I 2 )). By the induction hypothesis, MDD(h) < W{B^x) 
and MDD(I 2 ) < W(Pj-i), thus, max.{MDD{h), W(Bi),MDD(I 2 )) = W(Bi). m 

First, we show that if a i^-Sterling-optimal strategy makes K trades, all of which are contained in 
intervals of one of the pools Pj, then a i^-Sterling-optimal strategy exists which is composed of the 
K intervals with highest return in some pool Pj with j < i. 

Lemma 3.26 If K subintervals of the intervals of pool Pi are a K- Sterling- optimal strategy, then 
for some j < i, the K intervals with highest return of pool Pj are a K- Sterling- optimal strategy. 

Proof: If Pj = Po, then the claim is trivial. Suppose that i > 0, and let T = {p, . . . , Ik} be the 
A'-Sterling-optimal strategy whose trades are all subintervals of intervals in Pj. Consider the set 
B of all bridges in 7^* that are contained in T, B = {Pj}[ =1 . We can assume that B is not empty 
because if it were, then T is composed of intervals in 7^, in which case the top K intervals (with 
respect to return) in are clearly optimal. Since Pj contains all the intervals in T, Pj contains 
all the bridges in B. Thus, there must exist j < i such that Pj contains all the bridges in B and 
no pool Pfc, with k < j has this property, i.e., Pj was obtained from the previous pool by a regular 
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merge involving a bridge B* which must contain some bridge B[ E B. Let / be the interval in T 
that contains B\. Then, I must contain the whole bridge B*, since if B* is the result of irregular 
merges, one of which involved bridge Bi, then B* C Cl(Bi), and Cl(Bi) C I (Proposition 13.221) . 
Since B C I, MDD(T) > MDD(I) > W(B*). By Lemma [3J251 since B* was the last bridge 
involved in a merge, the MDD of every interval in Pj is at most W(B*). Since every interval of 
T is a subinterval of some interval in Pj, we conclude that T is contained in at most K intervals 
of Pj. Let Tj be the top K intervals in Pj. Then, the return of T is at most the return of . 
Further, MDD{T 3 K ) < W(B*) < MDD(T), and so Strl(7~ A ') > Strl(T), concluding the proof. ■ 

We are now ready to prove the main result, which will lead to the greedy algorithm for constructing 
a iT-Sterling optimal strategy. 

Theorem 3.27 Let j* be such that Strl(7^f) > Strl(7J x ), Vj. Then is K-Sterling optimal. 

Proof: Let Sq be a K-Sterling-optimal strategy that makes K trades - by Lemma 13.191 such a 
strategy must exist. If Sq has the same Sterling ratio as the trading strategy composed of the K 
most profitable trades in Pq, then we are done. If not, then we know from Proposition 13.181 that 
Sq is coarser than Pq. We prove the following statement for all k > 1 

Q(k): Suppose there exists a i^-Sterling-optimal strategy S k K _ 1 that makes K trades 
and is coarser than Pk-i- Then either SP-^ is composed of K intervals of Pk, or there 
exists a i^-Sterling-optimal strategy Sj? that makes K trades and is coarser than Pk. 

We know that Q(l) is true. Suppose that Q(k) is true for all k > 1, we then prove the proposition 
as follows. By an easy induction, we have that if none of the Sj ( _ 1 are composed of K intervals in 
Pj for all j < m, then there is a ^-Sterling-optimal strategy making exactly K trades that is 
coarser than P m . Suppose that we can construct a total of n + 1 pools, Pi for < i < n < K$ — K. 
If k < Kq — K then all the bridge weights in P K are infinite. If k = Kq — K, then any further 
merging leads to fewer than K intervals. In both cases, there cannot exist a X-Sterling-optimal 
strategy that is coarser than P K . Therefore, for some j* < k, the fC-Sterling-optimal strategy Sji_ ± 
is composed of K intervals of Pj*. By Lemma l3.26l there is a i^-Sterling-optimal strategy that 
is composed of the top K intervals of some pool Pi, where I < j*. 

What remains is to show that Q(k) is true for all k > 1. Suppose that S k K _ 1 is coarser than 
Pk-i and is not composed of K intervals in P^. We show that there exists S¥ that is coarser than 
Pk- Since is coarser than Pk-i, it contains at least one bridge B in Pk-i with finite weight 
(because if it contains an infinite weight bridge, then it either contains the preceeding or following 
bridge; this argument continues analogously to the proof of Lemma 13.241 until we include a bridge 
of finite weight). Let / be the interval of S^_ l that contains B, and let l\ and I r be intervals in 
(which are subintervals of /) connected by B. Let B* be the bridge in Pk-i with minimum 
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weight that was involved in the merge to get Pj. from P^—i, and let /* and I* be the intervals in 
Pfc_i connected by B* . If B* = B then S'£^_ 1 is also coarser than P& and we are done, so suppose 
B* ^ B. There are two possibilities: 

(i) B* is a regular bridge. If does not contain If or /*, then Pi (If U P* U /*) =0 and 
thus Sj?_ l itself is coarser than P&, and can be chosen as Sj? . Suppose that S^_ ± contains 
/* and not /* (similar argument if it contains I* and not If). Thus some interval I' € Sf?_ 1 
has as a suffix If. Suppose we construct Si? by replacing interval I' by interval I' U B* U I*. 
Sj? is then coarser than P^. Since B* is regular, the return of I' U B* U /* is at least 
as big as the return of I'. /* is either an interval of Pq ° r was obtained by merging some 
intervals of Po through bridges with weight at most W(B*) (Lemma l3.25|) . and so MDD(I*) < 
W(B*). Since the maximum cumulative return for I' is attained at its right endpoint (Lemma 
I3.17P and the left endpoint of /* is a minimum in I*, we have that MDD[I' U B* U /*) = 
max{AIDD(I'),W{B*),MDD(I*)} = m&x{MDD(I'),W{B*)}. Since W(B*) < W(B), we 
conclude that MDD(Sff) < MDD(Sj}_ 1 ), and thus Strl(Sf ) > Strl(S* J, which means that 
Sjf is also K-Sterling-Optimal. Finally, suppose that S^_ x contains both If and /*, and 
consider the strategy Sff obtained from Sj^__ 1 by removing bridge B and adding bridge B*. 
M (5^) = + W{B) - W(B*) > fi(Sk-i)- Since W{B) > W(B*), the MDD cannot 
have increased, and so S? is K-Ster ling-Optimal and coarser than P^. 

(ii) B* is an irregular bridge. Since B* = B(If,I*) has finite weight, we can conclude that B* is 
left-irregular and the previous bridge P_ = B(If_ 1 , If) is right-irregular or irregular. Since 

does not contain B* , by Lemma f3. 171 there are two possibilities: S^_ x does not contain 
If, in which case it also does not contain bridge P_ and so P_ and B* can be merged into one 
bridge without influencing Sf?__ v i.e., S^_ x is also more coarse than P&; or, contains If 
as one of its intervals. In this case, since B* is left-irregular, fi(If) < W{B*) < W(B), and so 
by dropping If from Sf?_ l and breaking I into two subintervals by removing B from / results 
in a profit increase of W(B) — fJ,(If) > 0. Further, the MDD cannot increase, so the new 
strategy makes K trades and has strictly greater Sterling ratio than Sffi, which contradicts 
optimality of S¥_i. Thus, Su_i cannot contain If as one of its intervals. 

Thus Q(k) holds for all k > 1, concluding the proof. ■ 

We are now ready to give the O(nlogn) algorithm that establishes Theorem 11.31 First, we can 
compute the optimal strategy that makes only one trade in O(nlogn) (Section I3.3|) . and compare 
this with the trivial strategy that makes no trades. It remains to compute the fT-Sterling-optimal 
strategy and pick the best. We show how to do this in 0(n log n) time. 
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First we obtain T* in linear time. Suppose T* makes Kq > K trades (as otherwise T* is our 
solution). By Theorem 13.271 we only need to construct the pools Pq,P\, . . ., maintaining the pool 
with the optimal Sterling ratio for its top K trades, as explained in the following algorithm. 

1: Set i = 0; Sort (in decreasing order) the intervals from Pq according to profit; Sort all the 
bridges with finite weight in increasing order. Let Bi be the minimum weight bridge in Pf, Let 
strategy Si consist of the top K intervals, and let Strl op t = Strl(S'j); 

2: while Pi contains at least K intervals and at least one finite weight bridge do 

3: if Bi = B(Ii,I r ) is regular then 

4: Regular merge to obtain -Pj+i: remove Ii,I r , Bi from the interval and bridge orderings, and 

add back / = /; U B; U I r into the interval ordering; compute /i(I) and MDD(I); 
5: Update neighboring bridge weigths and re-insert them back into the bridge ordering. 
6: else if Bi = B(Ii,I r ) is left-regular then 

7: Irregular merge to obtain Pj+i: Let B_ be the bridge left of Bi; remove Ii,B_, Bi from the 
interval and bridge orderings. Create the new bridge B = U Ii U Bi, compute W(B) 
and insert B into the bridge ordering (note that W(B) may be oo). 

8: end if 

9: i <- i + 1; update Strlj; if Strl opt < Strlj, then Strl opt Strlj. 
10: end while 

The correctness of the algorithm follows from Theorem 13.271 We now analyse the run time of 
an efficient implementation of the algorithm. Pq contains at most n intervals and bridges. Each 
execution of the while loop reduces loop number of bridges and intervals by 1 each, so the while 
loop is executed at most n/2 times. Merging two intervals is a constant time operation. The profit 
of a new interval is the profit of the merged intervals minus the weight of the merging bridge (also 
computable in constant time). The MDD of a new interval is the maximum of the MDD of the 
merged intervals and the weight of the merging bridge (also computable in constant time). The 
weight of a new bridge takes constant time to compute, and updating the weights of the neighbour 
bridges is a constant time operation provided that pointers are maintained to them. These pointers 
can be updated in constant time as well. Thus the run time within the while loop is dominated by 
inserting into the bridge or interval orderings. At most a constant number of such such inserts into 
a sorted list need to be done, and each is an O(logn) operation [BJ. To efficiently implement step 
9, we maintain two sorted lists of the top K intervals in the algorithm, sorted according to return 
and MDD. These can be maintained in O(logA') operations. The first allows us to update the 
total return of the top K intervals in constant time, and the second allows us to update the MDD 
of the top K intervals (by taking the interval with largest MDD) in constant time. Thus the total 
running time of the while loop is 0(nlogn + n log K) = 0(n\ogn) The preprocessing (step 1) is 
0(n), and so does not contribute to the asymptotic run time. 
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4 Sharpe Optimal Trading Strategies 



Another popular measure of the portfolio's risk-adjusted return is the Sharp Ratio. For trading 
strategy T, we consider two versions of the Sharpe ratio, Shrp 1 and Shrp 2 . 

Shr Pl (T) = ^j, Shrp 2 (T) = ^. (9) 

Note that Shrp 2 is more conservative in that it penalizes large variances more heavily. We introduce 
a simplified Sharpe ratio (SSR) S that will be instrumental to finding the optimal strategies, 

S = ^ 

2 

It is easy to check that maximizing Shrp 1 is equivalent to maximizing and that Shrp 2 is given 
by i I _ 2 , where f is the mean return. We will relate the maximization of Shrpj^ and Shrp 2 to the 
maximization of S. 

Let T be a trading strategy that makes K trades, with trading intervals I\, . . . , Ik- Each trade 
contributes a transactions cost of — f sp to the return sequence. In general, a trade contributes 
fs Jb t° s 2 - However, we will assume that f sp <C 1 and so we will ignore the contribution of the 
transactions cost to s 2 . Alternatively, we can justify this by assuming that the transactions cost is 
spread finely over many time intervals. The sum over these small time intervals is finite, equal to 
—fsp, however, the sum of squares over these small time intervals can be made arbitrarily small. 
Define the total return and sum of squared returns for each trading interval, 

!H = m = E^i< s * = s2 (^) = E^f- 

jeii jeii 

We define Ai as the contribution of trade i to the mean return, and Bi as the contribution of trade 
i to the mean squared return (ignoring the effect of the transactions cost), i.e., Ai = -(fii — f sp ) 
and Bi = ±s 2 . We define A(T) = J2k=i A * ( note that f = ^C 7 ")) and B 0~) = EfeLi B i (note that 

4.1 Maximizing the Simplified Sharpe Ratio S 

We will need the following technical lemma, which can be proved by an easy induction. 

Lemma 4.1 Let J r = {^,||,...,^-} be any set of fractions satisfying bi > and § < < f , 
for all i, where b,d > 0. Then, ^ < ^^^'"^ < f . The upper (resp. lower) bound is strict if at 
least one of the fractions in T is strictly upper (resp. lower) bounded by | [resp |) . 
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Let T* be an SSR-optimal strategy making K > 1 trades with trading intervals I\, . . . , Ik- 

— Si=i^ _ A(T*) 



EliBi B(T*} 



Lemma 4.2 ^ is a constant for every interval i, i.e., every trade is equivalent. 

Proof: Suppose that min^ 4 1 < 4^- for some j (strict inequality), and without loss of generality, 

% 3 

assume that the minimum is attained for interval I\. By Lemma |4. H if we remove 1%, we get that 

S(/ 1 u-uJ jr ) = ^^<^^ = S(J 2 u-uJ J r), 

A A- 

which contradicts the optimality of T* implying that muij = -g- for all j. ■ 

Corollary 4.3 An SSR-optimal trading strategy making one trade exists. 

Proposition 4.4 An SSR-optimal strategy making one trade can be found in O(nlogn) time. 

Proof: By Corollary I4.3[ we are guaranteed the existence of such a strategy. It suffices to find 
the single interval / maximizing ^2 i€ j r[i]/ Yliei r [*] 2 - Consider all intervals starting at position i 
and define Cfc = X^=i r b1 an d dk = Sj=i r b] 2 - We wish to find k to maximize Ck/d^. If we have 
done this for position i, we now consider position i — 1. We show that the algorithm in Section 
13.3.11 can be used. Trade intervals starting at i — 1 correspond to shifting all the Ck by r[i — 1], 
and all the dk by r[i — l] 2 . Both these operations simply correspond to shifting the origin point p 
to p' = p — (r[i — l],r[i — l] 2 ). We then add a new leftmost point at p. Since each update takes 
O(logn), and the optimal interval for the new points can be found in O(logn), the entire algorithm 
runs in O(nlogn). ■ 

4.2 Maximizing Shrp 2 

Ignoring the f sp 2 term in the denominator changes the denominator slightly, so we introduce the 



slightly different quantity Shrp 2 . Specifically, 

Shr P2 (T) = . 2 tZ] ,o,^ » Sh7p- 2 (T) 



if sp 2 + B{T) - A 2 (T) ' ™" B(T) - A 2 (T) ' 

where d is the number of trades in T ■ By the Cauchy-Schwarz inequality, for any trading strategy, 
^r[i] 2 > ^(^i r [*]) 2 - Since we are only interested in strategies for which -A(T") > 0, we have 
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Lemma 4.5 For strategy T, if A(T) > then B(T) - A 2 (T) > 



We will show that maximizing Shrp 2 is closely related to a constrained optimization of the SSR, 
and that maximizing Shrp 2 is not too far from maximizing Shrp 2 . 

Let be return optimal, with return f^(72) = A**- For any < a < fj,*, we define the 
constrained SSR-optimal strategy Ta as the strategy with maximum SSR among all strategies with 
return at least a, i.e., A(T a ) > a and for all strategies T with A(T > a), S(T a ) > S(T). Note that 
while an SSR-optimal strategy can be chosen with one trading interval, a constrained SSR-optimal 
strategy may require more than one trading interval. We show that for some appropriate threshold 
a, the constrained SSR-optimal strategy is a Shrp 2 -optimal strategy. 

Proposition 4.6 30 < a < /i* such that the constrained SSR-optimal strategy T a is S\r\rp 2 -optimal. 

Proof: Let T be any Shrp 2 -optimal strategy, and let a* = A(T). Let T a * be any constrained 
SSR-optimal strategy. Then A(T a *) > A(T) and since S(7^») > S(7~), we have that 

< A(fa)B(T) - A(T)B(Tcr). 

Suppose that Shrp 2 (7^*) < Shrp 2 (T), then 

< A(T a *)B(T) - A(T)B(T a *) < A(T a *)A(T) ■ (A(T) - A(T a *))- 

Both A(T a *) and A{T) are > 0, otherwise both strategies are inferior to 7^*; thus A(T) > A(T a *), 
which is a contradiction. Therefore Shrp 2 (7^») > Shrp 2 (7") and so T a * is Shrp 2 -optimal. ■ 

We will need the following technical property of any SSR-optimal interval (see appendix for proof). 

Proposition 4.7 Let J be a subinterval of an SSR-optimal interval I. Then, [J,(J) > —f sp - 
Further, if J is a prefix or suffix of I, then fi(J) > 0. 

The next result establishes the intuitive result that adding an SSR-optimal interval to any trading 
strategy can only improve the strategy. 

Proposition 4.8 Let Lq be any SSR-optimal interval, and let T be any trading strategy. Let 
T' = J U T. Then A(T') > A(T) and S(T') > S(T) 

We can now give the intuition behind our algorithm. The starting point is Proposition 14.61 which 
says that it suffices to look for constrained SSR-optimal strategies. So the natural first choice is 
an unconstrained SSR-optimal interval To- Either this will be Shrp 2 optimal or not. If not, it is 
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because it has too small a return. So our next step is to add to this interval a new interval (possibly 
adjacent) with the property that the interval increases the return with smallest possible decrease in 
SSR, resulting in strategy Ti- We repeat this process, constructing a sequence of trading strategies 
7o,7i,... with the property that A{%) > A(Ti~i), and among all other strategies T such that 
A{T) > A(7i_i), S(7I) > S(7~). We then pick the strategy %* with maximum Shrp 2 ratio among 
these strategies, which will be globally sharpe optimal. 

Suppose that we have a current strategy, Ti- We need to determine the next piece to add to 
this so that we increase the return, with smallest possible decrease in SSR. Let % be composed of 
the intervals Iq,I\, . . . , We replace each of these intervals by a special symbol, $, to signify that 
these regions are already included in the strategy. We thus obtain a generalized returns sequence, 
one in which some intervals are replaced by the $ symbol. A generalized trading strategy on the 
generalized return sequence must be composed of trades that do not contain the $ symbol. However 
trades may be adjacent to the $ symbol. A trade interval / in a generalized trading strategy can be 
isolated (not adjacent to any $ symbol), extending (adjacent to one $ symbol), or bridging (adjacent 
to two $ symbols). In order to correctly account for the transactions cost, we need to change how 
we compute A(I), so we introduce the new function A(I): 



A(I) = 



A(I) I is isolated 

A(I) + ^ I is extending 
M 1 ) + % 1 is bridging 



The generalized simplified Sharp ratio ( GSSR) for generalized strategy T = {I±, . . . , Id} is 



S(T) 



Similar to the notion of a maximal return optimal strategy, we introduce the notion of a maxi- 
mal SSR-optimal (or GSSR-optimal) interval as one which cannot be extended in either direction 
without decreasing the SSR (or GSSR). 

We now define generalized return sequences {Ro, R\, ■ ■ .} as follows. Rq is just the original 
returns sequence. Let Ij be a maximal GSSR-optimal interval for We obtain the generalized 
sequence Ri+\ by replacing Ii C Ri with the symbol $. We define any set of generalized sequences 
obtained in this way as monotone. We also refer to a member of a monotone set as monotone. 
Let Ro, . . . , Rk be a monotone sequence of gerenalized returns sequences, and let Io,I\, ---,1k 
be the maximal GSSR-optimal intervals corresponding to each sequence. By construction, Ii is a 
maximal GSSR-optimal interval for R{. We have defined A so that the SSR of the union of these 
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intervals in Rq is given by 



S Ro (I UhU---Ul k )- , 

where the subscript Ri indicates on which generalized return sequence the quantity is computed. 
Lemma 4.9 S Ri (Ii) > S Ri+1 (I i+ i) 

Proof: Suppose that S Ri (Ii) < S Ri+1 {Ii+i), and let be the symbol that replaced Ii in Ri to 
obtain R4+1. If h+i is not adjacent with $j, then i, is not GSSR-optimal in Ri, a contradiction. If 
Ij+i is adjacent with $j, then Ii U /j+i has higher GSSR (by Lemma |4.1|) . so once again Ii is not 
GSSR-optimal in Ri. ■ 

Now an easy induction, using Lemmas 14.11 and 14.91 gives. 
Corollary 4.10 S R() (I U I x U ■ ■ ■ U I k ) > S Rk (I k ) for any k. 

Analogous to Propositions 14.4 1 14.71 B~8l we have the following three propositions. Their proofs are 
almost identical, so we omit them. 

Proposition 4.11 A GSSR-optimal strategy making one trade exists, and all maximal GSSR- 
optimal trades can be found in O(NlogN) time. 

Proposition 4.12 Let J be a subinterval of any GSSR-optimal interval I. Then /i(J) > —fsp- If 
J is a prefix or suffix of I that is not adjacent with the symbol "$", then /i(J) > 0. 

Proposition 4.13 Let Iq be any GSSR-optimal interval, and let T be any generalized trading 
strategy. Let V = h U T. Then, A(V) > A(T) and S(T') > S(T). 



We now give the main result that will lead to the final algorithm to obtain the Shrp 2 -optimal strat- 
egy. Its essential content is that given a monotone set of generalized returns sequences, Rq,R\, . . ., 



with corresponding GSSR-optimal intervals Iq,I\, ■ ■ ., for some k, T = Iq U Ii U • • • U I k is Shrp 2 
optimal. We will need some preliminary results. 



Proposition 4.14 For some k, T* = Iq U Ii U • • • U I k is Shrp 2 - optimal, where Ii are the GSSR- 
optimal intervals corresponding to a monotone set of generalized returns sequences. 



Proof: First we show that there exists a Shrp 2 -optimal strategy 7o that contains Iq. Indeed, 



let T be any Shrp 2 -optimal strategy, and consider 7o = To U T ■ By the Proposition 14.81 we have 
S(7o) > S(T) and A(%) > A(T) > 0. Then, 

A{%)A{T){A{%) - A(T)) + B{%)B{T){S{%) - S(T)) 



Shrp 2 (r ) - Shrp 2 (T) = (B(Tq) - A 2 (%)){B(T) - A 2 (T)) * °' 
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thus, 7o is Shrp 2 -optimal. 



Let 7fc be a Shrp 2 -optimal strategy that contains Jq U ■ ■ ■ U Ik- We know that Tq exists. If 
Tk = Iq U • • • U Ik, then we are done. If , Tk = Iq U • • • U U T', with T' 7^ 0, then we show that 



there must exist a Shrp 2 -optimal strategy Tk+i which contains Iq U • • • U Ik+i, he., there is some 
other T" 2 Ik+i such that 7fc+i = Iq U ■ ■ ■ U Ik U 7~" is Shrp 2 -optimal. The proposition then follows 
by an easy induction. 

Let T" = T'Ulk+i. Then, A Rk+1 (T") > A Rk+1 (T) and S Rk+1 (T") > S Rk+1 (T) (Proposition 
I4.13p . By Corollary 14.101 and the GSSR-optimality of Ik+i, we have that 

S(/ U . . . U 4) > S Rk+1 (I k+1 ) > S Rk+1 (T") > S Rk+1 (T') 

From now on, we will drop the R k +i subscript. Let A = A(Iq U . . . U Ik), B = B(Iq U . . . U Ik), 
A' = A(T'), B' = B(V), A" = A(T") and B" = B(T"). Let Sh7^ 2 = Sh7^ 2 (/ U ... U h), 
Shrp' 2 = Shrp 2 (/ U . . . U I k U T) and Shrp 2 = Shrp 2 (/ U . . . U I k U T"). Thus, 

A A + A' , A + A" 



ShrP2 = B~AV Shr P2 = B + B>-{A + A>r ^ = B + B»-{A + A»V 

Let A" = a" A and A' = a' A, where a" > a' > 0. Then, by direct computation, one obtains 



A , A 



Shr P2 = — 77, -< 7W> 7^71 and Shr P2 



B + A3 + T&&-B-A*) B + Al + ^i^rr-B-A*) 



Since Shrp 2 > Shrp 2 , we conclude that ^ — B — A 2 < 0. Since S(T") > S(T'), we have that 
> and since a" > a' > 0, > jf^ > 0, therefore 



a" (B" t2 \ a' ( B' 

B — A ) < I — — B — A \ < 0, 



1 + a" V a" / " 1 + a' \ a 



and so Shrp 2 > Shrp 2 , concluding the proof. 



By Proposition HTHl a Shrp 2 -optimal trading strategy can be obtained by constructing the strategies 
Tk, and then picking the one with the maximum value for Shrp 2 . The next proposition shows that 
this can be done in 0(n 2 logn) time. 



Proposition 4.15 A Sbrp 2 -optimal trading strategy can be found in time 0(n logn). 

Proof: Ii can be obtained in O(nlogn) time (Proposition 14. II]) . Since there are at most n such 
intervals (since each must be non-empty), obtaining all the intervals is in 0(n 2 logn). 

Given the intervals, a single scan can be used to obtain the k for which Tk is Shrp 2 -optimal. ■ 
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One can improve the runtime to 0(n 2 ) if 0(n 2 ) memory is available, however, we do not discuss 
the details. 



4.2.1 Approximation Ratio 



We have given an algorithm that obtains a Shrp 2 -optimal strategy. A modification to the algorithm 
constructs the hierarchy T% and pick the one with the highest value of Shrp 2 . Suppose we have 



a Shrp 2 -optimal strategy T and let T* be a Shrp 2 -optimal strategy. Then by Proposition 14. 6[ it 
must be that A(T*) > A(T) and that S(T*) < S(T). Since Shrp 2 (T) > Shrp 2 (T*), we have that 
A*(B - A 2 ) - A(B* - A* 2 ) < 0, where A = A(T),A* = A(T*),B = B(T),B* = B(T*). We can 
evaluate Shrp 2 (T*) — Shrp 2 (T) to obtain 



< Shrp 2 (T*) - Shrp 2 (T) < Shrp^ • 



d f 2 
-f sp 2 + B-A 2 



d t 2 



When B — A 2 = 0(1), which is usually the case, we see that this is a very accurate approximation 
(since f sp < 1). 

4.3 Maximizing Shrp 1 



Once again, we introduce the slightly different quantity Shrp l5 

Shr Pl (T)= ; A{T) ShTp-^T)- A{T) 



ifs P 2 + B(T)-A 2 (T) VB(T)-A 2 (T) 



We will optimize Shrp 1 (T). Since maximizing Shrp 1 (T) is equivalent to minimizing l/Shrp 1 (T) the 
problem reduces to maximizing 

A 2 (T) 



Q{T) 



B(T) 



The entire algorithm is analogous to that for maximizing Shrp 2 in the previous section, we only 
need to prove the analogs of Propositions 14.61 and 14.141 

Proposition 4.16 30 < a < fj,* such that the constrained SSR-optimal strategy T a is Q-optimal. 



Proof: Let T be Shrp^optimal, and let a* = A(T). Let T a * be a corresponding constrained 
SSR-optimal strategy. A(T a *) > A(T) and gffi^ > B^T) 1 Multiplying these two inequalities gives 
that ^jff^j > "bCt) ' ^ ,e " r ^ a * ^ s a ^ so Q-optimal. ■ 

Proposition 4.17 For some k, T* = JoUiiU- • -Ul^ is Q-optimal., where Ii are the GSSR-optimal 
intervals corresponding to a monotone set of generalized returns sequences. 
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Proof: The proof is very similar to tho proof of Proposition 14.141 Let T be Q-optimal, and let 
To = Iq U T. Then A{To) > A(7~) and S(7o) > S(T). Multiplying these two inequalities give that 
Q(7o) > Q(T), or that 7o is also Q-optimal. 

Let 7fc be a Q-optimal strategy that contains Ij U • • ■ U 4. Introduce T',T" = Ik+i U T' 
as in the proof of Proposition |4~14"1 Let Q = Q(I U . . . U 4), Q' = Q(I U . . . U I k U T') and 
Q" = Q(/ U...U4UT"), 



Q = -77, Q = 73 1 73/ ' and 



£? " B + B 1 " S + B" 

Following exactly the same logic as in the proof to Proposition 14,141 we only need to show that 
Q" > Q'. Let A" = a" A and A' = a' A, where a" > a' > 0. > ^ implies that f£ < and so 

Till Y>t 

— 'TrTTT) ■ °y direct computation, one obtains 



a"(a"+2) — 

A 2 A 2 

Q' = -, \ / „, Q" 



B + ( X ~ (l+a') 2 ) (a'(f'+2) ~ S j S + ( 1 ~ (<*"(f"+2) ~ B 

Since Q' > Q, it must be that ,, B , , „s — B < 0. Since a" > a' 1 — , 1 , n2 > 1 — 1 /Yl , so we 
have that 

,-,1 ^ i . „). (• . ' U . , n .„. 



[1 + a") 2 / V a 'K + 2 ) / V (1 + «') 2 / V"'( Q ' + 2 ) 
which implies that Q" > Q' . ■ 

4.3.1 Approximation Ratio 

Once again, a modification to the algorithm constructs the hierarchy % and picks the one with 



the highest one with value of Shrp^ Suppose we have a Shrp 1 -optimal strategy T and let T* be a 



Shrp^optimal strategy. By direct computation, and using the fact that Shrp 1 (T) > Shrp 1 (T*) 
A* 2 B - A 2 B* < 0, we get 



< Shrp 2 (T) - Shrp 2 (T) < Shrp 2 (T) 



which gives an approximation ratio of Jl — 0(f sp 2 ) when B = O(l). 
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5 Discusion 



Our main goal was to provide the theoretical basis for the computation of a posteriori optimal 
trading strategies, with respect to various criteria. In particular, we have presented the algorithms, 
together with the proofs of their correctness. The highlights of our contributions are that return and 
sterling optimal strategies can be computed very efficiently, even with constraints on the number 
of trades. Sharpe optimal strategies prove to be much tougher to compute. However, for slightly 
modified notions of the Sharpe ratio, where one ignores the impact of bid-ask spread squared we 
can compute the optimal strategy efficiently. This is a reasonable approach since in most cases, the 
bid-ask spread is ~ 10 -4 . We also show that this modified optimal strategy is not far from optimal 
with respect to the unmodified Sharpe ratio. 

Interesting topics of future research are to use these optimal strategies to learn how to trade 
optimally, which was the original motivation of this work. Further, one could use them to bench- 
mark trading strategies as well as markets. A natural open problem is whether Sharpe optimal 
strategies can be computed under constraints on the number of trades. We suspect that the mono- 
tone hierarchy we created with respect to the SSR has an important role to play, but the result has 
been elusive. 

The algorithms presented here can be useful if one could relax Assumption Al. We have used 
such an assumption because it considerably simplifies the analysis, and in many cases, the optimal 
strategy is in fact an all-or-nothing strategy. 

At the core of some of our algorithms is a new technique for optimizing quotients over intervals 
of a sequence. This technique is based on relating the problem to convex set operations, and for our 
purposes has direct application to optimizing the MDD, the simplified Sharpe ratio (SSR), which 
is an integral component in optimizing the Sharpe ratio, and the Downside Deviation Ratio (DDR). 
This technique may be of more general use in optimizing other financially important criteria. As a 
result, such an algorithm may be of independent interest. 
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A Technical Proofs 



Lemma 13.131 

Proof: Let A = 0(m) be the size of the current convex hull. For (1), we do not change the points at 
all, we simply compute the new tangent point for p' = p+v, which can be accomplished in 0(log A). 
(2) is equivalent to shifting p by —v. To prove (3), notice that if we remove (do,co), then the new 
leftmost point becomes (di, ci) and we immediately have the new convex hull nxt(l), nxt(nxt(l)), .... 
Thus we can find the new upper tangent point in 0(logA') = O(logm), where A' is the size of 
the new convex hull. Further, deleting (do,co) requires first deleting the backward pointers of the 
points that it points to 0(logA), and then deleting the point itself, and its forward pointers (it 
has no backward pointers), OilogA). To prove (4), note that when we add (d_i,c_i), nxt(— 1) 
is exactly the upper tangent point from p' = (c2_i,c_i) to the current convex hull. This can be 
computed in O(logvl). We now need to add all the necessary pointers into the data structure. For 
each forward pointer we add, we will add the coresponding backward pointer as well. We need a 
pointer at position 2 J in the convex hull of (d-i, c_i). But this is exactly the point at position 2 J — 1 
in the convex hull of point nxt(— 1). Since nxt(— 1) maintains a pointer to point 2 J in its convex 
hull, and this point will have a backward pointer by one step of this same convex hull, we can 
construct the forward an backward pointer for point 2 J in the convex hull of c_i) in constant 
time, requiring total time O(log^f) = O(logm) to construct all the new forward and backward 
pointers, where A' is the size of the new convex hull. We now construct the new upper tangent 
point from p to the new convex hull of (d-i, c_i) in 0(log A') time. The entire process is therefore 
O(logm). ■ 

The algorithm that we have just described is a general purpose algorithm for efficiently maintaining 
the upper tangent point to any set of points, as long as only a limited set of operations is allowed 
on the set of points and the source point. 

Proposition 13.181 

Proof: First we show that for every trading interval /* = [t a ,i{,] in 7^* with I D I* 7^ 0, one can 
pick 7g^| such that I* C I. Suppose to the contrary, that for some I*, either t a < ti and i& > ti or 
t a < t r and tb > t r . We will extend I without decreasing the Sterling ratio of T^ r \ so that I* C I. 
Suppose t a < ti and tb > t\ (a similar argument holds for t a < t r and > t r ). There are two cases: 

i. I* does not intersects any other interval of Applying Lemma f3.17l to J*, we have: C a <C\. 
Thus by extending / to [i j£r]) the return of the interval cannot decrease. Since MDD(I*) < 
f sp , this extension cannot increase the MDD(T^), since we already have that MDD(T^) > 

fsp- 
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ii. I* intersects with the previous trading interval of strategy Ti^: 3/' = [tii,t r >] 6 T£ r \ such that 



t a < t r > < t[. Since [V+i> ti-i] is a subinterval of I*, Y^j^r'+i % — ~~ fsp (Lemma 13. 16j) . If we 
merge I and /' by adding the interval [V+ij^z-i] into 7^], we save on the transaction cost 
of f sp , and so the total return will not decrease. We show that the MDD has not increased. 
Since C r < is a maximum in [ti>, t r /], the drawdown for all points in [V+i: U] is at most f sp . Since 
Ci is a minimum in [i/,t r ], we conclude that the drawdown for any point in [ti,t r ] is at most 
max{ f sp , MDD(I)}. Since MDD(T$f r \) > fsp-, we conclude that this merger does not increase 
the MDD. 

Note that fi(I) > otherwise we improve the return of T^ r \ by removing I, without increasing the 
MDD, ans so T^ r \ cannot possibly be optimal. Thus, without loss of generality, we assume that 
the return of I is positive. Suppose that I n 7^* = 0. Then, by adding I to 77", we strictly increase 
the return, a contradiction on the optimality of T*. Thus, every interval of T^j contains at least 
one interval of T* . Now consider the maximal prefix -P ma x 

of I that does not overlap with any 
interval of 77,* . Since we know that I contains some interval of 77* , we conclude that this maximal 
prefix must be adjacent to some interval of 7^*. By Lemma 13.161 this interval has strictly negative 
return, so removing it from I strictly increase the return of 7g^|, without increasing its MDD. 
This contradicts the optimality of l$£ r \ , thus, P max must be empty. Similarily, the maximal suffix 
of I that is non-intersecting with 7^* must be empty, concluding the proof. ■ 

Lemma 13.191 

Proof: If K = Kq, then 7^* itself is K-Sterling-optimal. If K < Kq, we show that if the number 
of trades made is less than K, we can always add one more interval without decreasing the Sterling 
ratio of the strategy. First, note that 7^ cannot contain all the intervals of 7^*, as otherwise (by 
the pigeonhole principle) at least one interval I = [ti, t r ] of 7^ contains two consecutive intervals 
h — fan in] an d I\ = [ti 2 ,t r2 ] of T*. The region between these two intervals has return less than 
— f sp (Lemma I3.16|) . so breaking up I into the two intervals [£;,in] an d [i/ 2 , ir-] will strictly increase 
the return, without increasing the MDD, contradicting the optimality of 7^1 ■ If 7g^| does not 
contain some interval of T* , then by adding this interval, we do not decrease the return or the 
MDD (Lemma 13. 16p . since the MDD is already > f sp . ■ 

Proposition 14.71 

Proof: If J is a prefix or suffix of I and n(J) < 0, then deleting J from I gives at least as much 
return, with smaller sum of squared returns, contradicting the SSR-optimality of I. Suppose that 
I = L U J U R where L and R are nonempty subintervals of I. If ^TjfeprTm < ~^Tn) ' * nen 
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Lemma 14. 1\ 



c m _ -fs P + ^{L) + {i{J)+ n{R) -f sp + fi(L) _ 
{> s 2 (L) + s 2 (J) + s 2 (R) s 2 {L) {) K) 



This contradicts the optimality of /, so we have 



H(J) + n(R) > -f sp + n(L) 



s 2 (J) + s 2 (R) ~ s 2 (L) ' 
Now, suppose that /i(J) < —f sp . Using eq:* and Lemma I4TT1 we find that 

»(J) + riR) . -fsp + ^R) -fsp + KR) = r (m 

1 ' - s 2 (J) + s 2 (R) s 2 (J) + s 2 (R) s 2 (R) [ h 

because s 2 (J) > 0. This contradicts the SSR-optimality of /, so /x(J) > — f sp - ■ 

Proposition 14.81 

Proof: If Jo is contained in some interval of T, then there is nothing to prove, so suppose that 
T UIq^T, and that T contains d > 1 trades Ji, ... ,J d . Note that S(T) = A(T)/B(T). If J and 
T do not intersect, then T' = h U T = {io,ii, • • • ,!<*}■ A(T') = A(T) + A(J ) > A(T), because 
A(I ) > 0. Since J is SSR-optimal, S(J ) = $7$ > ^ = S ( T )' so b y lemmaSU 

, ACT) i4(T) + A(/o) A(T) 
lM 5(T') B(T) + B(I ) ~ B(T) { h 

Suppose that Tfl/o ^ 0. We can decompose T into four parts (each part could be empty): 
T = Si U 5*2 U I\ U I r , where S\ contains intervals that do not intersect with Jo, S2 contains intervals 
that are contained in Iq, I\ is not contained in Iq but overlaps Iq on the left, and I r is not contained 
in Iq but overlaps Iq on the right. T' = IqUT = S\ U I\ U Iq U I r , i.e., adding Iq combines all 
the trades in {S2,Ii,I r } hito one trade. Since the internal regions of Iq have return at least —f sp 
and any prefix and suffix of Iq has positive return (Proposition 14. 7| ). we see that merging any two 
consecutive trades overlapping Iq decreases the number of trades by one, hence increases the return 
by fsp and the added interval loses at most f sp , hence this merge can only increase A(T). If either 
// or I r are empty, then we are addionally adding a prefix or suffix of Iq without changing the 
number of trades, which also increases A(T), thus we see that -A(T') > A(T). 
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Let's introduce the following definitions, 

A l = A(s 1 ) + -(fi(i l ni^) + f x(i r ni^)) 

a 2 = A(s 2 ) + -(fi(i l niQ)-f sp + t i(i r nio)-fsp) 

n 

By = B(5i) + i(a 2 (J,n^) + s 2 (I P n^)) 
n 

B 2 = B(S 2 ) + -(s 2 (iir\i ) + s 2 (i r ni )), 

n 

where Jo is the complement of Iq. Letting Aq = A(Iq) and Bq = B(Iq), we then have 

S(r) = s7TlV andS(r) = S4TlV 

Note that S(S 2 U (I t D Iq) U (J r fl 2q)) = |§, so by the optimality of J , || < ^. We show that 



i/x^n/o) < A _ A &VrM ) ^ A 



Hii anc I ' _ y < izi (•**") 

± s 2 (/*ni ) - £?o' ^ 2 (/ r n/ ) "So' v ; 

If not, then suppose (for example) that T^rpj^j > Then, 

contradicting the SSR-optimality of Jo- Again, by the optimality of Iq, S(S±) = grg^ < 5^ = 
S(/o)- it also has to be sharper than strategy Si. Thus, using (|**[) and Lemma 14.11 we have that 
jj^ < ^j-- Because A 2 is obtained from the returns of a collection of subintervals of Iq, it follows 
from Proposition 14.71 that A 2 < Aq- Now suppose that S(T) > S(T'), i.e., 

(A 1 + A 2 )(Bi + So) - (Ai + A )(Bi + B 2 ) > 0. 

Since A2 < Aq, it follows that B 2 < Bq. Rearranging terms in the equation above, we have that 

A 2 Si - AiS 2 Aq _ A x +A 2 

Bq{Bi + B 2 ) Bq Bi+B 2 

A 2 A 1 + A 2 A 2 B X - A X B 2 



> 



B 2 B\ + B 2 B 2 {Bi + B 2 ) 



Since S(Iq) > S(T), the first inequality shows that A 2 Bi — AyB 2 > 0. The second inequality then 
implies that B 2 > Bq, a contradiction. ■ 
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